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Power struggle 


The UK government’s decision to subsidize a nuclear power station while cutting support for 


renewables is short-sighted. 


eternal delight”. But then poets have rarely been charged with 

keeping the lights on. Some 200 years later, energy — and 
how to produce and harness it — is on track to become the defining 
problem of a generation. 

The latest dark satanic mill on the horizon in Blake’s green and 
pleasant land is a shiny new nuclear power station — confirmed 
last week after years of plotting. It will be the first built in the United 
Kingdom this century, and is one of the most significant nuclear deals 
worldwide since the meltdown at the Fukushima Daiichi nuclear 
power plant in Japan in 2011. 

As Chinese President Xi Jinping — in the United Kingdom for a 
state visit — was paraded arounda series of picture-book British loca- 
tions, it was announced that China was taking a 33.5% stake in the 
Hinkley Point C nuclear plant. French power company EDF will own 
the remaining 66.5%. 

Supported by billions of pounds of Chinese investment, the plant 
should provide 3.2 gigawatts of power when it fires up as planned in 
2025. But never mind the output: feel the cost. 

The UK government has agreed a price with the investors of at least 
£89.50 (US$137) in 2012 terms for every megawatt hour of power 
produced by the plant. This is roughly double the current market cost, 
but the government claims — with a little poetic licence of its own — 
that it “is competitive with other large energy sources such as gas and 
offshore wind”. 

The Conservative government also noted that this would mean 
abandoning the policy of several previous administrations that there 
should be no public subsidy for new nuclear power. 

In reality, it had little choice. (The previous administrations had 
no problems sticking to the line, because they had no new nuclear 
power to subsidize.) Nuclear power plants are among the single most 
expensive items that governments can buy, and as Britain has allowed 
its home-grown nuclear expertise to dwindle, so it has lost the chance 
to bury the exact cost in a tangle of public expenditure. Whether the 
money comes directly from the public as artificially high electricity 
bills, or indirectly through the public purse, British politicians last 
week admitted that the technology simply cannot pay for itself. 

In doing so, they have been criticized — there are plenty of oppo- 
nents who argue, with some justification, that this nuclear deal is a poor 
one for Britain. But in a way, the new-found candour about the costs 
does at last make the debate about energy a more honest one. Most 
power industries, to a greater or lesser extent, need a financial leg-up. 

Hinkley Point C will cost at least £18 billion. And that does at 
least buy a reduction in greenhouse-gas emissions compared with 
one carbon-heavy alternative for baseload generation: coal. But the 
UK government has done little to stress that point — it is hard for it 
to do so while systematically cutting off other low-carbon forms of 
electricity at the knees. 


Te: English poet William Blake once wrote that “Energy is an 


Last year, renewable energy sources supplied a record 19% of UK 
electricity (the global figure is around 22%), but the same govern- 
ment that is committed to subsidizing Hinkley Point C has set out to 
slash subsidies for renewables. Trade associations say that thousands 
of jobs could go. Businesses face collapse as sections of the solar indus- 
try deal with 87% cuts in their subsidies, and onshore wind is under 
threat. Perversely, ministers justified the sacrifice of these growing 

clean-energy sources by pointing to increas- 


“Most power ing public support for them. 

industries, to The United Kingdom is far from alone in 
a greater or struggling to balance short-term financial 
lesser extent, prudence with long-term environmental 
needa financial protection. Earlier this month, the Interna- 
leg-up.” tional Energy Agency reported that renew- 


able energy accounts for 45% of new power 
capacity added globally in 2014. But it warned that the rate of growth 
for renewables was slowing because of policy problems and uncertain- 
ties — especially in Europe and Japan. 

UK energy and climate-change minister Andrea Leadsom used the 
ugly word “trilemma” last week to describe the energy issue facing the 
United Kingdom: the need to reduce carbon emissions, to maintain 
supplies and to keep bills down. 

There is another trilemma, popular among project managers: the 
need for any project to be good, fast and cheap. The classic response 
could have been written for energy policy: ‘pick two. = 


Burst bubbles 


Two medical-technology companies illustrate 
the ups and downs of innovation. 


is challenged by upstarts. Often backed by entrepreneurs and 

investors, these firms promise to use new technology to disrupt, 
overturn and revolutionize. Some succeed and some do not, and there 
are fields in which the challenge to newcomers is proving stiffer than 
others. One of these is health care, and events over the past week or so 
demonstrate both the difficulties and the opportunities. 

Theranos and 23andMe are two medical-technology companies with 
their origins in Silicon Valley. Both have made headlines recently. Their 
stories may seem similar. But the differences offer an important lesson 
for would-be health disruptors: this industry can change, just not as 
quickly as entrepreneurs and their investors might hope, and only if 
those offering the change can also offer data to back up their claims. 
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Theranos in Palo Alto, California, promised to upend medicine with 
a device that can perform hundreds of diagnostic tests on just a few 
drops of blood. 23andMe, in Mountain View, California, sells genetic 
tests directly to consumers. Both are led by charismatic female found- 
ers: Elizabeth Holmes at Theranos and Anne Wojcicki at 23andMe. 
Both want to revolutionize the health-care industry and argue that 
patients should have access to their data. They have strong backing 
from Silicon Valley investors, and were hyped early on: a US$9-bil- 
lion valuation for Theranos, and lavish parties with media tycoons 
for 23andme. 

But both have seen their bubbles burst. On 16 October, The Wall 
Street Journal reported that the Theranos technology was not work- 
ing as billed, and that the firm was using conventional machines to 
perform most of its tests. The company has disputed some of the 
article’s claims. Holmes says that the company is now in a “pause 
period” because of scrutiny from US regulators. 

23andMe’s bubble burst in November 2013, when the US Food 
and Drug Administration (FDA) banned the inclusion of medi- 
cally relevant results in the company’s consumer genetic tests. On 
21 October, however, 23andMe relaunched consumer genetic tests that 
give a limited amount of medical information, with FDA approval. The 
new tests offer information for 36 diseases about a customer's status as 
a ‘carrier’ of genetic glitches that could cause disease if passed down 
through their children. 

Theranos could learn alot from how 23andMe returned to the regu- 
lators’ good graces. 23andMe has always been fairly open about its 
science; it publishes research papers in peer-reviewed journals and 
collaborates with scientists. Theranos, by contrast, has been tight- 
lipped about its data. Apart from detailed data for one herpesvirus 
test, approved by the FDA in July, the company has published only 
aggregate test performances on its website, not the primary data. 

23andMe says that coming back from its early mistakes with 


the FDA was an arduous process — requiring it to hire staff with 
expertise in health regulation and to compile detailed dossiers of data 
to prove that its tests work as advertised. The company previously 
had been slow to respond to the FDA’ entreaties — and that tone- 
deafness seems to have been part of the reason that the agency eventu- 
ally cracked down. 

These experiences do not mean that health care cannot be 
disrupted. Indeed, 23andMe is the first company to gain FDA 
approval to sell a health-related genetic 


“Time and test to consumers without a doctor’s order. 
again, new That’s a real change. 

health-care Still, the new tests offer less information 
firms areforced  thanbefore andata higher price. With a few 
torealizethatit exceptions, carrier tests do not say anything 
helps no one to about the health of the individual tested, and 
be secretive with they are mainly for rare diseases — a far cry 
data.” from the risk-prediction scores the company 


previously offered for cancer and Alzheimer’s 
disease. And the new test package costs US$199 compared with $99 
before the ban. 

23andMe has also moved to make itself into a more conventional 
pharmaceutical firm. In March, it hired former Genentech executive 
Richard Scheller to lead a drug-development arm. If you can't beat 
them, join them. 

Time and again, new health-care firms are forced to realize that it 
helps no one to be secretive with data. Even if it turns out that the 
Theranos technology does not work as well as advertised, the company 
would hardly be the first to find itselfin that situation. Releasing more 
information earlier might have forced Theranos to confront shortcom- 
ings. Instead, it finds itself trying to recover from a regulatory and pub- 
lic-relations hole. This is not an insurmountable situation, as 23andMe 
knows. The challenge now is for Theranos to show us the data. m 


Ghost story 


The problem of abandoned fishing gear and its 
effects on marine life deserve greater attention. 


across the world, fish, crabs and other marine life are being 

drawn into nets and traps by the dead and decomposing 
bodies of their comrades. There they will stick, struggle and tangle, 
until they, too, become unwitting bait and continue the circle of death. 
Old fishing nets, you see, never die. They just drift away. 

The problem of ghost fishing, as it is known, goes largely unnoticed, 
but some of this dead gear catches and kills more sea life than it did 
when it was alive and in active use. Reliable data on the scale of the 
problem are scarce, but some estimates suggest that the nets can 
remove up to 30% of the landed catch of certain fish species. 

It is said that we know more about the surface of the Moon than 
about the bed of the sea. Perhaps we are afraid of what we will find 
there if we look too hard: wrecks of gill nets, entangling nets, pelagic 
and demersal longlines, lobster and crab pots, seine nets, trawl-net 
fragments and the sinister-sounding fish aggregating devices — buoys 
or floats, tethered to concrete blocks, around which fish tend to 
congregate. 

Some of this fishing gear is lost and some is abandoned in rough 
weather. Much is simply discarded by fishers with nowhere to stow it, 
who are fishing where they should not be, or who just want to avoid the 
expense and hassle of disposing of it properly. Most of this gear sinks 
to the bottom. It becomes a hazard, to boats and divers. And much of 
it continues to catch and kill, long after it has been forgotten. 
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Take the coastline of Louisiana, a US state that is home to its fair share 
of spooky tales. Each of the 1,800 or so professional crab fishers who 
work there loses about 250 traps every year. Each abandoned trap, a 
crude wire cage, is reckoned to catch and kill a blue crab (Callinectes 
sapidus) every two weeks. That is 12 million crabs a year, or 2 million 
kilograms of crab meat — about US$4-million worth — along a single 
stretch of coastline (J. A. Anderson and A. B. Alford Mar. Pollut. Bull. 
79, 261-267; 2014). Ghost crab traps snare other creatures too: spotted 
sea trout, diamondback terrapins and river otters among them. 

Although the world organizes regular conferences to address the 
threat of orbiting space junk, action on the danger of ghost fishing tends 
to be left to volunteers. Louisiana law allows a ten-day period each year 
when citizens can drag derelict fishing gear from the water. In two ses- 
sions — 2012 and 2013 — volunteers recovered a total of 3,607 ghost 
crab traps. More than 65% of them had caught something. The actions 
of such volunteers are admirable but they are not enough. The fishing 
industry and those who profit from it must take more responsibility. 

Earlier this year, Eric Gilman, a fisheries scientist at Hawaii Pacific 
University in Honolulu, published a survey of international efforts to 
track and control ghost fishing (E. Gilman Mar. Policy 60, 225-239; 
2015). Of the 19 global and regional bodies (from the International 
Whaling Commission down to the South East Atlantic Fisheries 
Organization) that he identified as being in a position to intervene, 
just 4 had an explicit mandate to monitor and reduce the problem. 
Almost half did not even collect data on lost gear. The 12 organiza- 
tions that have introduced measures to help prevent and reduce ghost 
fishing have not used all the options available to them. 

All ghost stories are more chilling in the dark. 
The problem of abandoned, lost and discarded 
fishing gear deserves more attention and more 
action. For unlike many gruesome stories you 
will hear this weekend, this one is true. m 
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concerns about the accuracy of the statistical interpretation of 
DNA evidence, and it is now checking whether convictions 
going back more than a decade are safe. 

Despite how it is often portrayed, in the media and in courts, 
the forensic science of DNA is far from infallible. Particularly con- 
cerning is that police and prosecutors now frequently talk of ‘touch 
DNA’ — genetic profiles of suspects and offenders that have been gen- 
erated in a laboratory from just a handful of skin cells left behind in 
a fingerprint. 

Research done by me and others at the University of Indianapolis in 
Indiana has highlighted how unreliable this kind of evidence can be. We 
have found that it is relatively straightforward for an innocent person's 
DNA to be inadvertently transferred to surfaces 
that he or she has never come into contact with. 
This could place people at crime scenes that they 
had never visited or link them to weapons they 
had never handled. 

Such transfer could also dilute the statistics 
generated from DNA evidence, and thereby ren- 
der strong genetic evidence almost insignificant. 
(This issue is the focus of the Texas investigation.) 

We urgently need to review how DNA evi- 
dence is assessed, viewed and described. Every- 
one in the medico-legal community — forensic 
scientists and technicians, DNA analysts, poten- 
tial jurors, judges and lawyers for both the 
prosecution and defence— must know and 
understand the potential for mistakes. 

The term ‘touch DNA conveys toa courtroom 
that biological material found on an object is the result of direct con- 
tact. In fact, forensic scientists have no way of knowing whether the 
DNA was left behind through such primary, direct transfer. It could 
also have been deposited by secondary transfer, through an interme- 
diary. (If 1 shake your hand then I could pass some of your skin cells 
onto something that I touch next.) 

Contamination from secondary DNA transfer was raised as 
a possible problem in Nature in 1997 (R. A. H. van Oorschot and 
M. K. Jones Nature 387, 767; 1997). It is known to happen, but has 
largely been dismissed by legal experts as being rare outside the con- 
ditions of a laboratory. Experiments done in real-world conditions 
seemed to support this, and concluded that secondary DNA transfer 
would have little impact on interpretation of the genetic profile. 

Itis important to recognize that DNA amplification kits have become 
much more sensitive than they were in the past. 


| \ arlier this month, the Texas Forensic Science Commission raised 
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Forensic DNA evidence is 
not infallible 


As DNA analysis techniques become more sensitive, we must be careful to 
reassess the probabilities of error, argues Cynthia M. Cale. 


surfaces for otherwise invisible cells left behind, on the handle of a 
weapon or on a windowsill, perhaps, and ask labs to generate a DNA 
profile from them. The new kits can generate a full genetic profile ofa 
suspect from as little as 100 picograms (trillionths of a gram) of DNA. 

These subtleties are not usually explained in court. Instead, a jury 
is told that there is a one-in-a-quadrillion chance that the evidence 
retrieved from the crime scene did not come from a defendant. Natu- 
rally, the jurors assume that the defendant must have been there. 

Given the power of modern forensic techniques to pull a DNA 
profile from a smudge of cells, secondary DNA transfer is no longer 
a purely theoretical risk. In California in 2013, a man called Lukis 
Anderson was arrested, held for four months and charged with murder 
after his DNA was found under the fingernails of a homicide victim. 

Anderson had never met the victim and was 
severely intoxicated and in hospital when the 
man was killed. The same paramedics who took 
Anderson to hospital responded to the murder. 
Most likely, the paramedics were covered in 
Anderson’s DNA, which they then inadvertently 
transferred. The charges were dropped. 

Experiments in our labs, under the supervi- 
sion of forensic anthropologist Krista Latham, 
show how easily DNA can be transferred 
to an object. 

We asked pairs of people to shake hands for 
two minutes and then each individual handled 
a separate knife. In 85% of cases, the DNA of the 
other person was transferred to the knife and 
profiled. In one-fifth of the samples, the DNA 
analysis identified this other person as the main 
or only contributor of DNA to the ‘weapon (C. M. Cale et al. J. Forensic 
Sci. http://doi.org/8j2; 2015). 

How significant is the result ofa single study? Other analyses have 
shown that DNA transfer can be unpredictable and can depend on 
environmental conditions. We need more research on when and how 
secondary transfer can occur. 

At the very least, the results highlight again that samples at crime 
scenes must be gathered with great care. DNA can persist on latex 
gloves, so they must be changed — or bleached — before and after 
handling evidence. 

Even apparently rigorous evidence such as DNA profiles can be 
interpreted in multiple ways, some of which will be incorrect. As the 
technology to generate these profiles continues to accelerate, so must 
our efforts to sift out possible mistakes. m 


Cynthia M. Cale is a human-biology graduate student at the 
University of Indianapolis, Indiana, and lead forensic DNA analyst III 
at Strand Diagnostics. 

e-mail: ccale@strandlabs.com 
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Selections from the 
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RESEARCH HIGHLIGHTS 


Alzheimer’s clue 
from spatial test 


Young adults who are at 
increased risk of developing 
Alzheimer’s disease show 
abnormal function ina part 
of the brain involved in spatial 
navigation. 

Nikolai Axmacher at 
Ruhr University Bochum in 
Germany and his colleagues 
used neuroimaging to measure 
the functioning of the ‘grid- 
cell’ system in the entorhinal 
cortex as participants with or 
without the APOE-e4 risk gene 
navigated a virtual arena. 

The 38 high-risk individuals 
showed reduced grid-cell 
functioning compared with the 
37 controls and tended to avoid 
the centre of the arena. Activity 
in the hippocampal region 
was higher when the grid-cell 
system was impaired, as if to 
compensate for the deficit. 
Reduced grid-cell functioning 
may prove useful as an early 
biomarker of Alzheimer’s, the 
authors suggest. 

Science 350, 430-433 (2015) 


Red-giant rogue in 
Andromeda 


Astronomers have spotted a 
giant ‘runaway star’ speeding 
through the Andromeda 
galaxy; the first of its kind seen 
outside the Milky Way. 

Whereas most stars flow 
together around the centre of 
their galaxy, some, known as 
runaways, travel at different 
directions and speeds to their 
peers, and may even escape 
their galaxy entirely. Spotting 
red-supergiant runaways is 
rare — stars in this late stage 
of life left their birthplace long 
ago, making their unusual 
speeds harder to spot than for 
younger runaways. 

Kate Evans and Philip 


ZOOLOGY 


Bright light as sex signal 


Brighter female glow-worms lay more eggs than their dim 
rivals and are more attractive to potential nocturnal mates. 

Juhani Hopkins at the University of Oulu in Finland and 
his colleagues allowed 26 female glow-worms (Lampyris 
noctiluca; pictured) to mate in the lab. The glowing lanterns 
of the insects varied in size from 7 square millimetres to 
19 square millimetres — larger lanterns produce a brighter 
glow. Each glow-worm laid between 25 and 195 eggs, with 
those perceived by the researchers to be brightest laying the 
most. Male glow-worms presented with fake females also 
preferred those with brighter lights. 

The lanterns of female glow-worms may provide clues 
about fitness to males, who are unable to assess size — also an 
indicator of fecundity — in the dark. 

Biol. Lett. http://dx.doi.org/10.1098/rsbl.2015.0599 (2015) 


Massey at the Lowell 
Observatory in Flagstaff, 
Arizona, studied a 

red supergiant in the 
Andromeda galaxy known 

as J004330.06+405258.4 

and calculated that it is 
travelling 400-450 kilometres 
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per second faster than its 
neighbours. The star is 

the first massive runaway 

to be spotted outside our 
own galaxy and the fastest 
anywhere for its size, say the 
authors. 

Astron. J. 150, 149 (2015) 


MATERIALS 


lron skin senses 
the softest touch 


Aniron-based artificial skin 
can sense the lightest touch. 

Ahmed Alfadhel and Jiirgen 
Kosel at the King Abdullah 
University of Science and 
Technology in Thuwal, Saudi 
Arabia, made a tactile sensor 
by embedding iron nanowires 
in hair-like structures called 
cilia, made ofa polymer called 
polydimethylsiloxane, on a 
magnetic sensor. When the 
magnetized cilia are bent by 
touch, they trigger the sensor. 

The skin can flex and, 
depending on the size of the 
cilia, can be sensitive enough 
to measure a person's wrist 
pulse. Because the cilia use 
permanent magnets rather 
than electromagnets, the 
device does not require much 
power and — unlike most 
other tactile sensors — it can 
work underwater and measure 
liquid flow, the authors say. 
Adv. Mater. http://doi.org/f3jp2n 
(2015) 


Gene regulation 
predates animals 


The oldest ancestor of animal 
life used the same tricks that 
modern humans do to turn 
genes on and off. 

Alex de Mendoza at the 
Institute of Evolutionary 
Biology in Barcelona, Spain, 
and his colleagues studied 
gene regulation in the fungus- 
like single-celled organism 
Creolimax fragrantissima, 
which branched onto a 
separate evolutionary path 
before the evolution of 
multicellular organisms. 

To produce different cell 
types, multicellular organisms 
use three main gene-regulation 
processes: transcription 
factors, alternative splicing and 
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non-coding RNAs. The authors 
found that C. fragrantissima 
uses the same processes to 
switch between life stages, 
meaning that these regulatory 
elements were likely to have 
been used by the last universal 
common ancestor of all 
animals, the authors say. 

eLife http://doi.org/8kh (2015) 


P_ODISEASE 
Plague is an 
ancient pathogen 


Plague was plaguing humanity 
thousands of years earlier than 
previously thought, but in a 
less transmissible form. 
Yersinia pestis bacteria, 
which are thought to have 
been behind the Black Death 
that killed millions in the 
fourteenth century, have 
previously been found in burial 
sites dating back 1,500 years. 
But Eske Willerslev at the 
Natural History Museum of 
Denmark in Copenhagen and 
his colleagues looked even 
further back. They analysed 
DNA obtained from the teeth 
of 101 humans (pictured) 
who died in Europe and Asia 
between 2,800 and 5,000 years 
ago and found Y. pestis DNA 
in seven individuals. Analysis 
of the DNA showed that a 
strain similar to the Black 
Death strains was widespread 
in the Bronze Age, but only 
the more recent strains had a 
gene called ymt, which helps 
Y. pestis to colonize the guts 
of fleas. Without fleas to aid 
transmission, plague spreads 
less efficiently. 
Cell http://dx.doi.org/10.1016/ 
j.cell.2015.10.009 (2015) 


STEM CELLS 


Molecular menu 
creates neurons 


Astrocyte cells in the brain 
can be reprogrammed into 
neurons using a precise 
sequence of molecules. The 
technique may one day be 
useful in brain repair. 

Similar cells have previously 
been reprogrammed into 
neurons using viruses, but 
Gong Chen and Gang- Yi 
Wu at Pennsylvania State 
University in University Park 
and their colleagues now show 
that the transformation can be 
done with small molecules. 

They treated human 
astrocytes with nine different 
molecules in sequence, 
converting them into neurons 
that survived for more than 
five months in culture and 
more than one month after 
transplantation into a mouse 
brain. The method works 
for human brain astrocytes 
but not for human spinal 
astrocytes or mouse astrocytes, 
suggesting that different sets 
of molecules may be needed 
for different astrocytes or 
to obtain different neuronal 
subtypes, the authors report. 
Cell Stem Cell http://doi.org/8m5 
(2015) 


LAB TOOLS 


Superconducting 
sensors warm up 


An extremely sensitive, 
superconductor-based 
magnetic sensor can work 
at around 77 kelvin, a 
temperature achievable with 
liquid nitrogen rather than 
the expensive liquid helium 
required by typical existing 
devices, which operate at just 
above absolute zero. 
Superconducting quantum- 
interference devices (SQUIDs) 
can sense individual 
quanta of magnetic flux by 
measuring voltage induced 
in a loop of superconducting 
material. Boris Chesca of 
Loughborough University, 
UK, and his team connected 
hundreds of loops in series to 
boost the signal. The authors 
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SOCIAL SELECTION 


Popular topics 
on social media 


Backlash over journals blacklisting 


Researchers on social media are split over the decision of 
academic librarian Jeffrey Beall to add the Frontiers journals 
to his ‘blacklist’ of “questionable publishers”. Beall, at the 
University of Colorado Denver, announced the move in 
a tweet, saying that it followed “wide disapproval from 
scientists”. His website Scholarly Open Access maintains a list 
of journals that may be “predatory publishers” — a term Beall 
coined for publications that charge scientists fees to publish 
research but that do not offer services such as peer review, or 
that make misleading claims on impact factors or indexing. 
Critics of Beall’s blacklisting of Frontiers maintain that the 
open-access publisher is reputable and does offer proper 
peer review. Daniél Lakens, an experimental psychologist at 
the Eindhoven University of Technology in the Netherlands 
and an associate editor at Frontiers in Psychology: Cognition, 
tweeted: “Frontiers being added to 


NATURE.COM 
For more on 

popular papers: 
go.nature.com/ch6Gau 


Beall’s list reveals the big weakness of 
Beall’s list: It's not based on solid data, 
but on Beall’s intuition.” Beall told 

Nature that he stands by his decision. 


say that their SQUID design 

is as sensitive as many devices 
already in use and is ready 

for production. The higher 
operating temperature makes 
it ideal for applications such as 
portable magnetic resonance 
imaging machines, they say. 
Appl. Phys. Lett. 107, 162602 
(2015) 


Arctic snow is not 
becoming dirtier 


Dust and soot might not 
be behind the observed 
darkening of the Greenland ice 
sheet (pictured). 

Tiny particles of dirt 
absorb sunlight that would be 
reflected into space by ice — 
contributing to local warming. 
Satellite measurements suggest 
that the amount of sunlight 
reflected by Greenland’s icy 


surface has been decreasing 
since 2001. 

But surveys of the snow 
in northwest Greenland 
conducted in 2013 and 2014 
by Chris Polashenski at the 
US Army Cold Regions 
Research and Engineering 
Laboratory in Fort 
Wainwright, Alaska, and 
his colleagues, found that 
the concentration of dark- 
coloured particles was much 
the same as in previous 
decades. Rather than dirtier 
ice, the declining reflectivity 
seen in satellite measurements 
could be due to a degrading 
sensor on NASA's Earth- 
observing satellite Terra. 
Geophys. Res. Lett. http://doi. 
org/8kg (2015) 
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SEVEN DAYS 


EVENTS 


Afghan quake 

At least 300 people have 

been killed, with the number 
expected to rise, after a 
magnitude-7.5 earthquake 

hit northeast Afghanistan 

on 26 October. The strong 
quake, which according to the 
US Geological Survey struck 
76 kilometres south of the 
city of Fayzabad at a depth of 
around 210 kilometres, caused 
landslides and avalanches 

in the remote Hindu Kush 
region and sent tremors that 
were felt in neighbouring 
Pakistan and as far away as 
Delhi in India. Earthquakes in 
the tectonically active region 
result from the collision of 
the Indian subcontinent with 
central Asia. 


Science subpoenaed 
The chairman ofa US 
congressional committee has 
issued a subpoena requesting 
to see all documents and 
communications related to a 
June study by federal scientists 
showing that global warming 
has continued unabated over 
the past 15 years. Lamar Smith 
(Republican, Texas), chair of 
the Committee on Science, 
Space, and Technology, issued 
the subpoena on 13 October 
after the National Oceanic and 
Atmospheric Administration 
refused to provide internal 
communications. The 
subpoena came to light 
publicly on 23 October after 
Democrat Bernice Johnson 
from Texas objected to the 
“jllegitimate harassment of our 
Nation’s research scientists”. 


Meltdown payout 
Japan’s Ministry of Health, 
Labour and Welfare has said 
that it will pay industrial- 
accident compensation toa 
member of the clean-up crew 
who worked on the damaged 
Fukushima Daiichi nuclear 
power plant and who has 


The news in brief 


Hurricane Patricia smashes records 


Hurricane Patricia broke a slew of meteorological 
records just before it pounded the west coast of 
Mexico on 23 October (pictured in an infrared 
satellite image). The category-5 storm intensified 
rapidly before making landfall, with wind 

speeds of almost 324 kilometres per hour and 

a central pressure of 87.9 kilopascals, making 


recently developed cancer. 
According to the Japanese 
media, the ministry said on 

20 October that the causal link 
between the worker's exposure 
and his leukaemia was 
“unclear”. The government 
could be exposed to many 
more claims. Thousands 

of workers took part in the 
clean-up operations after three 
reactors went into meltdown 
in March 2011, following an 
earthquake and a tsunami. 


POLICY 


Bonn climate talks 
A draft text for a global 
greenhouse-gas-reduction 
agreement was drawn up 

at talks in Bonn, Germany, 
between 19 and 23 October. 
During the talks, Sri Lanka 
and the United Arab Emirates 
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became the 155th and 156th 
countries to submit pledges for 
mandatory domestic action, 
which will be at the core of 
any United Nations climate 
agreement made in Paris 

this December. Meanwhile, 

in a briefing released on 

21 October, the International 
Energy Agency said that 
US$13.5 trillion need to be 
invested globally in energy- 
efficiency and low-carbon 
technologies over the next 

15 years to compensate for the 
planned reduction in fossil- 
fuel use. 


Marine reserve 
The Congress of Palau 
approved plans to create a 
500,000-square-kilometre 
marine reserve around 


the Pacific island nation 
on 22 October. The 


it the strongest hurricane ever recorded in 
the Western Hemisphere. Heavy rains raked 
the coast between the cities of Puerto Vallarta 
and Manzanillo, causing landslides, but the 
devastation was less than feared because the 
most intense winds and storm surge struck in 
relatively lightly populated areas. 


announcement came in the 
same week as an analysis 
showed that the amount of the 
world’s oceans that is ‘strongly 
protected, with some fishing 
allowed, or ‘fully protected’ has 
increased from 0.1% to 1.6% in 
the past decade (J. Lubchenco 
and K. Grorud-Colvert Science 
350, 382-383; 2015). The area 
that is afforded some level of 
protection is 3.5%, still well 
short of the internationally 
agreed target of 10% by 2020. 


Climate regulation 
Pakistani Prime Minister 
Nawaz Sharif endorsed 
proposals to regulate 
hydrofluorocarbons (HFCs) 
under the Montreal Protocol 
on Substances that Deplete 
the Ozone Layer. Commonly 
used as refrigerants, HFCs 
were created as alternatives to 
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chlorofluorocarbons, which 
damage the ozone layer — 
but HFCs are still powerful 
greenhouse gases. Sharif 
declared his support in a joint 
statement with US President 
Barack Obama on 22 October 
after a visit to the White 
House. At a meeting in Dubai 
on 1-5 November, parties 

to the Montreal Protocol 

will consider amendments 
intended to phase out HFCs. 


PEOPLE 
Science top job 


Australia’s new prime 
minister, Malcolm Turnbull, 
appointed engineer Alan 
Finkel as chief scientist on 

27 October. Finkel, who 
co-founded the popular- 
science magazine Cosmos 

and is currently chancellor of 
Monash University, is a vocal 
advocate of nuclear power 

as a carbon-neutral energy 
source, and of the need for 
policies to combat climate 
change. Media reports suggest 
that Finkel’s appointment 
may signal a policy shift in 

a country in which nuclear 
energy is banned and whose 
previous prime minister was 

a notorious climate-change 
sceptic. Finkel said in a press 
conference that he wanted to 
see the end of the use of coal, 
oil and gas in the country, with 
renewables and nuclear power 
being explored to achieve 
zero-emissions energy. 


TREND WATCH 


The number of US biomedical 


postdocs fell 5.5% between 2010 


and 2013, to just over 38,000, 
with losses getting bigger each 
year, according to an analysis 
(H. H. Garrison et al. EASEB J. 


http://doi.org/8m3; 2015). The 
main reason for the drop seems to 
be fewer positions for postdocs, 
but the time that individuals 
spend as postdocs is also falling. 
From 1979 to 2010, the number 
of US postdocs in the biomedical 
sciences rose steadily, from just 
over 10,000 to more than 40,000. 


See go.nature.com/fbkl5y for more. 


Lisa Jardine dies 


Historians of science, 
politicians and researchers 
paid tribute on 26 October 
after it was announced. 

that historian Lisa Jardine 
(pictured) had died. Jardine’s 
hugely varied career included 
work on eighteenth-century 
scientist Robert Hooke and 
the scientific revolution. She 
chaired Britain's Human 
Fertilisation and Embryology 
Authority between 2008 and 
2014, during a period when 
the agency led the way in 
regulating mitochondrial 
replacement and other cutting- 
edge scientific advances. 
Mitochondrial-replacement 
therapy was made legal in 
Britain in February 2015. 


| RESEARCH 
Mars lander 


Europe's first Mars rover, 
ExoMars, looks set to land 
on avast, clay-rich plain 
called Oxia Planum, the 


POSTDOCS IN DECLINE? 


European Space Agency (ESA) 
announced on 21 October. 
ExoMars, a joint endeavour 
between ESA and Russia's 
space agency Roscosmos, will 
land in 2019 and is designed 
to look for evidence of life. 
The craft’s instruments will 
include a drill capable of 
burrowing 2 metres into the 
Martian surface. Oxia Planum 
was picked from a shortlist 

of four potential landing sites 
(see Nature 508, 19-20; 2014); 
the agencies will confirm the 
choice six months before the 
mission’s 2018 launch. 


FACILITIES 


Telescope closed 
The United Kingdom Infrared 
Telescope (UKIRT) atop 
Mauna Kea in Hawaii will 

be permanently shuttered, 
making it the third closed 

to accommodate the next- 
generation Thirty Meter 
Telescope (TMT). The 
University of Hawaii took 
over the UKIRT — the most 
scientifically productive 
ground-based telescope 
worldwide — from the UK 
Science and Technology 
Facilities Council last 

year. The decision to close 
stems from a long-term 
decommissioning plan, which 
identifies the UKIRT as a site 
to be restored to its natural 
state. Hawaii’s governor has 
called for closure of one- 
quarter of Mauna Kea’s 13 


The number of researchers in US postdoctoral positions is starting 
to fall, particularly in the biomedical sciences — although the 


number of new PhDs is still rising. 
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SEVEN DAYS | THIS WEEK | 


1-4 NOVEMBER 
Researchers hoping to 
catch up on the latest 
in cancer genomics can 
do so at a European 
Molecular Biology 
Laboratory conference 
in Heidelberg, 
Germany. 
go.nature.com/p5uqwp 


1-4 NOVEMBER 
The Geological Society 
of America holds its 
annual meeting in 
Baltimore, Maryland. 
go.nature.com/ks35w9 


4-7 NOVEMBER 
Budapest hosts the 2015 
World Science Forum, 
themed ‘the enabling 
power of science. 
www.sciforum.hu 


telescopes by the time the 
TMT becomes operational 
in the 2020s (see Nature 526, 
24-28; 2015). 


| __BUSINESS 
23andMe revamp 


Genetic-testing company 
23andMe, based in Mountain 
View, California, launched 

a revamped consumer test 

on 21 October. The product 
offers information about a 
customer's genetic-carrier 
status for 36 diseases, marking 
the first time that 23andMe 
has been allowed to provide 
medically relevant results 
since the US Food and Drug 
Administration banned the 
inclusion of detailed predictive 
information in tests in 
November 2013. Previously, 
the product tested for 240 
health conditions. With a few 
exceptions, carrier-status tests 
do not say anything about the 
health of the individual and 
are mostly for rare diseases. 
See go.nature.com/5vicei and 
page 609 for more. 
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Trials of the world’s first vaccine against malaria in sub-Saharan Africa have shown that it offers only modest protection. 


AG 


BY EWEN CALLAWAY & AMY MAXMEN 


r | he world’s first vaccine against malaria 
should be rolled out in limited pilot 
demonstrations in Africa, an advisory 

group to the World Health Organization 

(WHO) in Geneva said on 23 October. 

The decision — which the WHO’s direc- 
tor-general is expected to formally endorse 
in November — follows 28 years of devel- 
opment by the London-based drug firm 
GlaxoSmithKline (GSK) and other backers 
including the Bill & Melinda Gates Founda- 
tion in Seattle, Washington; together they have 
spent US$565 million on the drug. 

The demonstrations, involving up to 
1 million children, are needed because the 
vaccine is ineffective against malaria unless 


children receive four doses spread out over 
18 months, and even then it offers only mod- 
est protection. 

“Tf this vaccine is not effective and we use it 
widely, we have spent a ton of money where it 
could be better placed,’ said Jon Abramson, a 
paediatric infectious-disease specialist at Wake 
Forest School of Medicine in Winston-Salem, 
North Carolina, and chair of the WHO Stra- 
tegic Advisory Group of Experts (SAGE) on 
Immunization, in a press briefing. 

The imperfections of the vaccine, called 
RTS,S and sold as Mosquirix, are well known: 
trials in more than 15,000 children, who were 
followed for up to four years in seven countries 
in sub-Saharan Africa, found that a series of 
four shots reduced the number of malaria cases 
by only 36% in young children, and by 26% in 


Vaccine gets cautious boost 


Malaria immunization endorsed for small-scale use in Africa. 


infants’. Still, even such a modest effect could 
be significant because malaria kills around half 
a million people annually, mostly children, and 
all other candidate vaccines are in much earlier 
stages of development. 

The advisory group recommended a series 
of 3-5 pilot demonstrations in areas with a 
medium to high incidence of malaria, involv- 
ing up to 1 million children in total. These will 
reveal whether parents bring their children 
back for all four doses of the vaccine, said 
Abramson. The pilots will also investigate 
safety issues associated with the vaccine, such 
as the potential to develop meningitis. 

The demonstrations could start in 2016 and 
are expected to last 3-5 years. Seth Berkley, 
head of Gavi, the Vaccine Alliance, in Geneva, 
Switzerland, says that his organization will 


29 OCTOBER 2015 | VOL 526 | NATURE | 617 


© 2015 Macmillan Publishers Limited. All rights reserved 


| NEWS IN FOCUS 


> soon decide whether it will help to pay for 
the pilot demonstrations. “It certainly is possi- 
ble that the board will say yes to this, but there's 
no guarantee,” he says. As data roll in, SAGE 
will review its position: a final decision on 
whether to recommend deploying the vaccine 
more widely could come during this period. 

Even if that happens, it is not clear what the 
uptake would be. Although African malaria- 
control officials welcome RTS,S, they say that 
they would need more funding to deploy the 
vaccine. Budgets for malaria prevention and 
treatment using measures such as insecticide- 
treated bed nets and artemisinin-combination 
therapies are already stretched thin. 

GSK says that it will charge $1-10 per shot, 
covering the company’s manufacturing costs 
anda return of 5%, to be reinvested in new vac- 
cines for malaria or other diseases that are com- 
mon in the developing world. But funding will 
also be needed to deliver the vaccine to children 
and for programmes to disseminate informa- 
tion. Parents must understand that their chil- 
dren can still get malaria even with the vaccine, 
says James Tibenderana, development director 
at the Malaria Consortium in Uganda. 


POOR MATCH 

SAGES decision to pilot the vaccine follows the 
publication of a study on 21 October, which 
revealed’ that the vaccine’s poor performance in 
clinical trials is in part because it mimics a strain 
of the malaria parasite Plasmodium falciparum 
that is not commonly found in Africa. 

The vaccine is composed partly of a frag- 
ment of circumsporozoite (CS) protein, which is 
found on the surface of the parasite. People who 
have been given RTS,S build up some immunity 
to malaria. But different parasites have slightly 
different CS proteins — and the study showed 
that fewer than 10% of parasites infecting some 
5,000 children in the trials matched the CS pro- 
tein in RTS,S. If the vaccine could be re-engi- 
neered to include bits of several surface proteins, 
it would be more effective, says Dyann Wirth, 
an infectious-disease researcher at the Harvard 
T. H. Chan School of Public Health in Boston, 
Massachusetts, who led the 21 October study. 

That redevelopment could take years, 
although some researchers have been discuss- 
ing the possibility, according to David Kaslow, 
who oversees the vaccine’s development at the 
non-profit health organization PATH. “It’s 
not trivial to tweak the vaccine to match the 
prevalent strains in an area,’ he says, “but it’s 
not impossible.” 

In the meantime, the advice to run demon- 
strations of RTS,S sends the right message, says 
Adrian Hill, a vaccinologist at the University 
of Oxford, UK. “What the field needs is other 
players to come forward and accelerate their 
more modern vaccine candidates.” = 


1. RTS,S Clinical Trials Partnership. Lancet 386, 
31-45 (2015). 

2. Neafsey, D.E. et al. N. Engl. J. Med. http://dx.doi. 
org/10.1056/NEJMoa1505819 (2015). 
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CLIMATE SCIENCE 


BY ALEXANDRA WITZE 


rilling through ice sheets is a tedious 
D= It takes years of fieldwork to 

retrieve long ice cores that keep a 
continuous record of the climate stretching 
back hundreds of thousands of years. 

Now there is a faster way to bore deep into 
Earth's history. Anxious to get to ice as old as 
1.5 million years, nearly double the age of the 
oldest existing core, climate researchers have 
developed a new generation of ‘rapid-access 
ice drills. Some of these rigs will face their 
first major tests during the Antarctic field 
season that begins this month. 

These speedy tools take roughly a week, 
rather than years, to penetrate several kilo- 
metres of ice. They blitz through the top- 
most layers of ice to reach the ancient freeze 
beneath, where tiny bubbles of trapped air 
serve as a time capsule of environments long 
vanished. 

One of the biggest and most ambitious 
machines, a US project known as the Rapid 
Access Ice Drill (RAID), is being shipped 


© 2015 Macmillan Publishers Limited. All rights reserved 


Ice cores allow scientists to analyse past precipitation and temperature changes. 


Super-fast drills 
hunt for oldest ice 


Researchers will test machines that can penetrate 
kilometres in days rather than years. 


in November from its construction site in 
Salt Lake City, Utah, to McMurdo Station in 
Antarctica (see ‘Climate clues’). The British 
Antarctic Survey will test a much smaller 
drill, also named RAID (for Rapid Access 
Isotope Drill) in December at the Sky Blu 
station on the Antarctic peninsula. French 
and Swiss research teams are developing their 
own fast drill designs. 

The drills sacrifice detail for speed, how- 
ever. They chip up or melt the ice as they go, 
so extracting an intact core is impossible. But 
these fast drills will be able to do quick sur- 
veys of places where researchers might return 
in future field seasons to extract a full ice core 
at a more leisurely pace. The US$10.5-million 
US RAID drill, for instance, is designed to 
plough through more than 3 kilometres of 
ice in about a week. That speed would allow 
it to hop around Antarctica and drill several 
exploratory holes per season — instead of 
one hole over several seasons. 

Even so, finding the planet’s most ancient 
ice will not be easy. “We're looking for a very 
fortuitous set of circumstances that allow for 
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The News story ‘Vaccine gets cautious 
boost’ (Nature 526, 617-618; 2015) 
incorrectly stated that David Kaslow was 


involved in the early development of RTS,S. 
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> soon decide whether it will help to pay for 
the pilot demonstrations. “It certainly is possi- 
ble that the board will say yes to this, but there's 
no guarantee,” he says. As data roll in, SAGE 
will review its position: a final decision on 
whether to recommend deploying the vaccine 
more widely could come during this period. 

Even if that happens, it is not clear what the 
uptake would be. Although African malaria- 
control officials welcome RTS,S, they say that 
they would need more funding to deploy the 
vaccine. Budgets for malaria prevention and 
treatment using measures such as insecticide- 
treated bed nets and artemisinin-combination 
therapies are already stretched thin. 

GSK says that it will charge $1-10 per shot, 
covering the company’s manufacturing costs 
anda return of 5%, to be reinvested in new vac- 
cines for malaria or other diseases that are com- 
mon in the developing world. But funding will 
also be needed to deliver the vaccine to children 
and for programmes to disseminate informa- 
tion. Parents must understand that their chil- 
dren can still get malaria even with the vaccine, 
says James Tibenderana, development director 
at the Malaria Consortium in Uganda. 


POOR MATCH 

SAGES decision to pilot the vaccine follows the 
publication of a study on 21 October, which 
revealed’ that the vaccine’s poor performance in 
clinical trials is in part because it mimics a strain 
of the malaria parasite Plasmodium falciparum 
that is not commonly found in Africa. 

The vaccine is composed partly of a frag- 
ment of circumsporozoite (CS) protein, which is 
found on the surface of the parasite. People who 
have been given RTS,S build up some immunity 
to malaria. But different parasites have slightly 
different CS proteins — and the study showed 
that fewer than 10% of parasites infecting some 
5,000 children in the trials matched the CS pro- 
tein in RTS,S. If the vaccine could be re-engi- 
neered to include bits of several surface proteins, 
it would be more effective, says Dyann Wirth, 
an infectious-disease researcher at the Harvard 
T. H. Chan School of Public Health in Boston, 
Massachusetts, who led the 21 October study. 

That redevelopment could take years, 
although some researchers have been discuss- 
ing the possibility, according to David Kaslow, 
who oversees the vaccine’s development at the 
non-profit health organization PATH. “It’s 
not trivial to tweak the vaccine to match the 
prevalent strains in an area,’ he says, “but it’s 
not impossible.” 

In the meantime, the advice to run demon- 
strations of RTS,S sends the right message, says 
Adrian Hill, a vaccinologist at the University 
of Oxford, UK. “What the field needs is other 
players to come forward and accelerate their 
more modern vaccine candidates.” = 


1. RTS,S Clinical Trials Partnership. Lancet 386, 
31-45 (2015). 

2. Neafsey, D.E. et al. N. Engl. J. Med. http://dx.doi. 
org/10.1056/NEJMoa1505819 (2015). 
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BY ALEXANDRA WITZE 


rilling through ice sheets is a tedious 
D= It takes years of fieldwork to 

retrieve long ice cores that keep a 
continuous record of the climate stretching 
back hundreds of thousands of years. 

Now there is a faster way to bore deep into 
Earth's history. Anxious to get to ice as old as 
1.5 million years, nearly double the age of the 
oldest existing core, climate researchers have 
developed a new generation of ‘rapid-access 
ice drills. Some of these rigs will face their 
first major tests during the Antarctic field 
season that begins this month. 

These speedy tools take roughly a week, 
rather than years, to penetrate several kilo- 
metres of ice. They blitz through the top- 
most layers of ice to reach the ancient freeze 
beneath, where tiny bubbles of trapped air 
serve as a time capsule of environments long 
vanished. 

One of the biggest and most ambitious 
machines, a US project known as the Rapid 
Access Ice Drill (RAID), is being shipped 
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Ice cores allow scientists to analyse past precipitation and temperature changes. 


Super-fast drills 
hunt for oldest ice 


Researchers will test machines that can penetrate 
kilometres in days rather than years. 


in November from its construction site in 
Salt Lake City, Utah, to McMurdo Station in 
Antarctica (see ‘Climate clues’). The British 
Antarctic Survey will test a much smaller 
drill, also named RAID (for Rapid Access 
Isotope Drill) in December at the Sky Blu 
station on the Antarctic peninsula. French 
and Swiss research teams are developing their 
own fast drill designs. 

The drills sacrifice detail for speed, how- 
ever. They chip up or melt the ice as they go, 
so extracting an intact core is impossible. But 
these fast drills will be able to do quick sur- 
veys of places where researchers might return 
in future field seasons to extract a full ice core 
at a more leisurely pace. The US$10.5-million 
US RAID drill, for instance, is designed to 
plough through more than 3 kilometres of 
ice in about a week. That speed would allow 
it to hop around Antarctica and drill several 
exploratory holes per season — instead of 
one hole over several seasons. 

Even so, finding the planet’s most ancient 
ice will not be easy. “We're looking for a very 
fortuitous set of circumstances that allow for 
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DEEP FREEZE 


Antarctica may harbour ice up to 1.5 million 
years old that could help scientists to understand 
Earth’s past climate. 
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the preservation of very old ice,’ says Jeffrey 
Severinghaus, a palaeoclimatologist at the 
Scripps Institution of Oceanography in La 
Jolla, California. Ideally, scientists would 
discover a thick sequence of ice layers, undis- 
turbed by flowing glaciers, that has not been 
heated too much by the rock below. Possible 
locations include several of the high-elevation 
Antarctic ice domes, such as Dome A near to 
which Chinas Kunlun research station sits, or 
Dome C, where European researchers took 
five years to extract a core that reached into 
800,000-year-old ice layers (see ‘Deep freeze’). 

Now researchers want to push even further, 
to ice that is at least 1.2 million years old. That 
would provide data on an important shift in 
Earth's climate, when the planet’s glacial cycles 
changed from being dominated by a 100,000- 
year pattern to 41,000-year cycles (H. Fischer 
et al. Clim. Past 9, 2489-2505; 2013). 

Knowing what controlled that switch — and 
whether rising carbon dioxide levels played a 
part, along with factors such as changes in 
Earth's rotational tilt — would help scientists 
to better understand how ice sheets will behave 
as the world warms. “If we don’t understand 
this we really don’t understand the climate that 
we have today,’ says Severinghaus. 
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The US and UK drills take different 
approaches to reach deep into Antarctica’s 
past. Once the US RAID reaches the bottom 
of the ice sheet, it could drill up to 50 metres 
into the underlying rock. Analysing that rock 
could reveal when it was last exposed to cos- 
mic rays — which, in turn, reveals the age 
of the overlying section of the Antarctic ice 
sheet. The first full-scale field trial for RAID 
is scheduled for 2016-17. 

The British RAID is a much more 
modest project that costs less than £500,000 
(US$770,000) and uses a modified conven- 
tional ice-core drill. It will be able to penetrate 
only about 600 metres into the ice sheet, to ice 
that is 30,000 to 40,000 years old — but unlike 
the US RAID, it does not require drilling fluid, 
the weight of which adds considerably to the 
cost of moving a drill around. “You cant dry- 
drill deeper than that,’ says Julius Rix, an engi- 
neer who is leading the drill’s development. 
“But there are plenty of places of interest.” 

A third drill, on a similar scale to the 
US machine, is the SUBGLACIOR probe 
being developed at Joseph Fourier University 
in Grenoble, France. This €3.2-million (US$5.3- 
million) project aims to melt rather than chip its 
way through the ice sheet, measuring chemical 
isotopes of the melted water as it goes, to calcu- 
late the age of the ice. The drill would be able to 
penetrate several kilometres deep; full testing 
is slated for 2016-17 at the Concordia research 
station in Antarctica, says Olivier Alemany, a 
polar engineer at Joseph Fourier University. 

A fourth project, dubbed RADIX, would use 
a much narrower hole than the others — just 
2 centimetres across — to bore up to 3 kilo- 
metres in a few days. RADIX has gone through 
limited testing in Greenland, says team leader 
Jakob Schwander, a climate scientist at the 
University of Bern. 

No one knows exactly what these drills 
might encounter when they hit bottom. They 
might penetrate into pristine lakes below the 
ice sheet, which microbiologists could explore. 
Or they might reveal heat radiating upwards 
from the bedrock, melting the ice in ways that 
scientists had not expected. 

“It’s very multidisciplinary, says John 
Goodge, a geologist at the University of Min- 
nesota Duluth and a project leader on US RAID. 
“There’s all kinds of stuff this fast technology 
allows that we were never able to have before.” = 
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CLIMATE CLUES 


The US-backed Rapid Access Ice 
Drill (RAID) seeks to penetrate into 
Antarctic ice faster than ever before. 
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ICE BOREHOLE SECTION 

* Rotating ice bit creates a 
8.9-centimetre-wide borehole. 

«Ice is chipped rather than 
cored. 


Ice 


iamond bit drills into rock. 
* Geological samples up to 
50 metres long retrieved. 
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US grants 
trapped in 
vicious circle 


Astronomers’ resubmissions 
drive plunge in success rates. 


BY CHRIS CESARE 


stronomers and astrophysicists in the 
Awe States are seeing their grant 

applications rejected at increasing rates 
because of stagnant budgets and an uptick in 
the number of resubmitted proposals, accord- 
ing to a draft report written for an advisory 
committee to the US National Science Foun- 
dation (NSF). The document, posted on the 
arXiv preprint server on 4 October, comes 
ahead of a November meeting set to discuss 
the issue (P. Cushman et al. Preprint at http:// 
arxiv.org/abs/1510.01647; 2015). 

The report highlights more than a decade 
of falling success rates for astronomical- 
science grants at the NSF and NASA as the 
number of proposals has increased faster than 
agency budgets. One key NSF programme 
in astronomy and astrophysics, for instance, 
funded fewer than 20% of proposals in 2014 
— down from nearly 40% in 2002. And some 
NASA programmes saw rates fall from around 
30% to 18% between 2004 and 2015. 

The report rules out many explanations that 
scientists have suggested for the drop, such as a 
decrease in the quality of proposals; data from 
NASA show that, among the astrophysics grant 
proposals submitted to the agency, the fraction 
receiving scores from ‘very good to ‘excellent’ 
remained roughly constant from 2007 to 2012. 

Instead, the report concludes, the main prob- 
lem is that whereas funding has stayed flat, the 
total number of astronomers has continued to 
grow — and so the rate of resubmitted proposals 
has risen even faster because investigators who 
fail to secure funding in one year often try again 
the next. These resubmissions now account for a 
disproportionate number of grant applications, 
compounding the problem and leading to the 
dramatic drop in success rates. 

The report enumerates the “knobs” that agen- 
cies can adjust to improve success rates, such as 
reducing the size of the average grant or shifting 
money from facilities to investigators — an idea 
that deserves a closer look, says Keivan Stassun, 
an astronomer at Vanderbilt University in Nash- 
ville, Tennessee, and a co-author of the analysis. 

Other fixes, such as capping the number of 
proposals from investigators who have submit- 
ted too many unsuccessful applications, only 
“disguise the problem’, the report argues. m 
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BY HELEN PEARSON 


n ambitious study that planned to 
Az information on 80,000 British 

babies throughout their lives has 
ended just 8 months after its official launch 
because not enough prospective parents 
signed up. The closure comes less than 
a year after the US National Institutes of 
Health (NIH) cancelled a similar effort to 
trace 100,000 children from birth, prompt- 
ing fears that researchers will now shy away 
from proposing similar studies. 

“Tam afraid that the scientific community 
may not dare to embark on similarly ambi- 
tious cohort studies in the near future,” 
says Camilla Stoltenberg, who heads the 
Norwegian Institute of Public Health in Oslo. 
She is responsible for a major birth-cohort 
study in Norway and chaired the inter- 
national scientific-advisory committee to 
the UK project, called Life Study. 
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Life Study aimed to find associations between factors early in life and outcomes later on. 


Massive UK baby 
study cancelled 


After demise of similar US project, decision prompts 
rethink about design of future cradle-to- grave efforts. 


Prized by both medical researchers and 
social scientists, birth-cohort studies reveal 
associations between factors early in life, 
such as poverty or a mother’s diet in preg- 
nancy, and outcomes later on, ranging from 
diseases to cognition and earnings. Various 
efforts already exist around the world, but 
Life Study was to be one of the biggest and 
most ambitious yet. It got the green light 
in 2011 when government funding bodies, 
including the Economic and Social Research 
Council (ESRC) and the Medical Research 
Council, agreed to support the study with 
£38.4 million (US$58.9 million) until 2019. 

In January 2015, a team led by Carol 
Dezateux, a paediatric epidemiologist at 
University College London’s Institute of 
Child Health, opened the study’s first dedi- 
cated recruitment centre, on the outskirts of 
London. The researchers hoped to sign up as 
many as 16,000 prospective mothers — ofa 
total target of 60,000 — by July 2016. Another 


MASTERFILE/CORBIS 


20,000 babies were to be recruited nationwide 
after birth. But between January and early 
September this year, just 249 women signed 
up, according to the ESRC, which oversaw the 
study. A review of the project in July identified 
recruitment as a major concern, andon 10 July, 
the ESRC decided that the study should close. 
The cancellation was publicly announced on 
22 October. 


PREMATURE DEMISE? 

Dezateux and some of her colleagues say that 
the closure was premature, and that they were 
not sufficiently consulted on the decision. 
They accept that recruitment was difficult, a 
challenge intensified by the study’s remit to 
include a substantial proportion of families 
from ethnic-minority and disadvantaged 
groups, who have historically been particularly 
hard to recruit. 

But the researchers say that they intended to 
test and refine recruitment methods during the 
first phase of the study — for example, the team 
had planned to make the study less burdensome 
for women by collecting information during a 
routine ultrasound scan rather than asking fora 
separate visit — and that the review process did 
not take such plans fully into account. 

Fiona Armstrong, who was responsible for 
Life Study at the ESRC, says that the research 
council did indeed consider the researchers’ 
plans to adjust the recruitment process — and 
consulted the research team as part of that 


process — but ultimately, it still concluded that 
“whatever might be done wasn't enough”. “We 
couldn't take the risk of putting more and more 
money into it,” she says. The study consumed 
around £9 million ($13.8 million), a sliver of 
the more than $1.2 billion — over 15 years — 
that was sunk into the US National Children’s 
Study (NCS). 

Epidemiologists are drawing parallels 
between Life Study’s demise and that of the 
NGS. “It’s déja vu all over again,” says Mark Kle- 
banoff, a paediatric epidemiologist at Nation- 
wide Children’s Hospital in Columbus, Ohio. 

Clinical epidemiologist George Davey 
Smith, who co-directs a separate birth-cohort 
study at the University of Bristol, UK, notes 
that a huge challenge for both efforts was 
that they were trying to provide answers 
to extremely diverse questions, which put 

constraints on the 


iT3 4 

We have tofi ind studies’ designs. For 
ways of doing example, assessing 
this that pose inequalities between 
theleast burden socio-economic 
possible to groups requires data 


participants.” from a large, repre- 
sentative popula- 
tion sample that includes disadvantaged and 
minority groups — whereas answering ques- 
tions relating to the origin of disease requires 
the collection of extensive biological samples 
such as blood and tissues. “It’s incredibly sad,” 


he adds of Life Study’s end. 
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Those involved in both studies hope to 
salvage something from the wreckage. After 
the NCS ended, plans emerged for a more 
modest study of influences on child health; 
Dezateux says that she and her colleagues are 
“determined to take forward key elements” of 
Life Study. 

Whether and how such studies can be 
conducted in future is unclear. Response rates 
are falling in many surveys and population 
studies compared with those in decades past, 
say researchers — perhaps because there are 
more demands on people’s attention. “We have 
to be mindful of the fact that people's lives are 
busier than ever,’ says Klebanoff. “We have 
to find ways of doing this that pose the least 
burden possible to participants.” 

Scientists need to exploit existing data 
sources more, says Stoltenberg. Extensive 
databases of health, educational and income 
data exist in many countries and provide vast 
amounts of information on the cheap — as 
long as people consent to their use. In Norway, 
such databases have been crucial to the success 
of its national birth-cohort study, which is fol- 
lowing more than 100,000 children, she says. 

But it is important to create systems through 
which information can be more easily extracted 
from such databases for use in cohort and other 
types of research, she adds. “We don't have the 
infrastructure,’ she says. “We're trying to drive 
sophisticated vehicles like birth-cohort studies 
where there are no real roads.” = 
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PLANETARY SCIENCE 


Falling junk has scientific value 


Astronomers prepare to observe an impact off Sri Lanka. 


BY TRACI WATSON 


esearchers call it sheer coincidence 
Re a newly discovered piece of 

space junk is officially designated 
WT1190F. But the letters in the name, 
which form the acronym for an unprintable 
expression of bafflement, are an appropriate 
fit for an object that is as mysterious as it is 
unprecedented. 

Scientists have worked out that WT1190F 
will plunge to Earth from above the Indian 
Ocean on 13 November, making it one of 
the very few space objects whose impact can 
be accurately predicted. More unusual still, 
WT1190F was a ‘lost’ piece of space debris 
orbiting far beyond the Moon, ignored and 
unidentified, before being glimpsed by a 
telescope in early October. 

An observing campaign is now taking 
shape to follow the object as it dives through 
Earth’s atmosphere, says Gerhard Drolshagen, 


nter the 
13 November. 


Bay of 
Bengal 


Indian 
Ocean 


Projected 
impact point 


co-manager in Noordwijk, the Netherlands, 
of the European Space Agency’s near-Earth 
objects office. The event not only offers a 
scientific opportunity to watch an object 
plunge through the atmosphere, but also tests 


The object WT1190F, discovered in early October, 
appears as a blurry speck in an image taken by 
the University of Hawaii’s 2.2-metre telescope. 


the plans that astronomers have put in place 
to coordinate their efforts when a potentially 
dangerous space object shows up. “What we 
planned to do seems to work,’ Drolshagen says. 
“But it’s still three weeks to go.” 
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> WT1190F was detected by the 
Catalina Sky Survey, a programme based at the 
University of Arizona, Tucson, aimed at 
discovering asteroids and comets that swing 
close to Earth. At first, scientists didn't know 
what to make of this weird body. But they 
quickly computed its trajectory after collecting 
further observations and unearthing 2012 and 
2013 sightings from telescope archives, says 
independent astronomy-software developer 
Bill Gray, who has been tracking the debris 
with astronomers at NASA’s Jet Propulsion 
Laboratory in Pasadena, California. 

WT1190F travels in a highly elliptical orbit, 
swinging out twice as far as the Earth-Moon 
distance, Gray says. His calculations show 
that it will hit Earth at 06:20 utc, entering the 
ocean about 65 kilometres off the southern tip 
of Sri Lanka (see ‘Splashdown’). Much, if not 
all, of it will burn up in the atmosphere, but “I 
would not necessarily want to be going fishing 
directly underneath it’, Gray says. 

The object is only 1 to 2 metres in size, 
and its trajectory shows that it has a low 
density, and is perhaps hollow. That sug- 
gests an artificial object — “a lost piece of 
space history that’s come back to haunt us,” 
says Jonathan McDowell, an astrophysicist at 
the Harvard-Smithsonian Center for Astro- 
physics in Cambridge, Massachusetts. It could 
bea spent rocket stage or panelling shed bya 
recent Moon mission. It is also possible that 
the debris dates back decades, perhaps even 
to the Apollo era. An object seen orbiting 
Earth in 2002 was eventually identified as a 
discarded segment of the Saturn V rocket that 
launched the second mission to put humans 
on the Moon. 

WT1190F is a rare breed of space object. 
Researchers are currently tracking only 20 or 
so artificial objects in distant orbits, says 
Gareth Williams, an astronomer at the Minor 
Planet Center in Cambridge, Massachusetts. 
There are probably many more such pieces 
of space junk in orbit around the Earth- 
Moon system, but it is impossible to say how 
many. No others are known to have made the 
return trip to Earth, although it is likely that 
some have done so without anyone noticing, 
McDowell says. 

Drolshagen plans to get spectral informa- 
tion on the object, which may help to identify 
it, and he hopes to coordinate impact obser- 
vations conducted on ships or aeroplanes. 
But that may be the end of the concerted 
effort to study this class of object. Unlike 
near-Earth asteroids, space debris that flies 
well away from Earth has not commanded 
significant amounts of funding or atten- 
tion. The US military, which tracks space 
debris, says that it lacks the ability to identify 
WT1190F or to predict its path. 

“There is no official, funded effort to do 
tracking of deep-Earth orbits the way we track 
low-Earth orbit,” McDowell says. “I think that 
has to change”. = 
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Cancer-fighting 
viruses near market 


Anticipated approval in Europe and the United States 
could spur a promising field with a chequered past. 


BY HEIDI LEDFORD 


n engineered herpesvirus that 
Avent an immune reponse against 

cancer seems poised to become 
the first treatment of its kind approved 
for use in Europe and the United States. 
On 23 October, advisers to the European 
Medicines Agency endorsed the approval 
of a genetically engineered virus called 
talimogene laherparepvec (‘T- VEC) to treat 
advanced melanoma. In April, advisers to the 
US Food and Drug Administration (FDA) 
did the same, and the agency is expected to 
approve T-VEC this month. 

With dozens of ongoing clinical trials 
of similar ‘oncolytic’ viruses, researchers 
hope that such an approval could generate 
the enthusiasm and cash needed to spur 
further devel- 


opment of the Viral infection 
approach. “The Canravagea 

era of the onco- tumour while’ 
lyticvirusisprob- leaving abutting 
ably here,” says healthy cells 
Stephen Russell,a untouched. 


cancer researcher 
and haematologist at the Mayo Clinic in 
Rochester, Minnesota. “I expect to see a great 
deal happening over the next few years.” 
Many viruses preferentially infect cancer 
cells. Malignancy can suppress normal 
antiviral responses, and sometimes the 
mutations that drive tumour growth also 
make cells more susceptible to infec- 
tion. Viral infection can thus ravage a 
tumour while leaving abutting healthy cells 
untouched, says Brad Thompson, president 
of the pharmaceutical-development firm 
Oncolytics Biotech in Calgary, Canada. 


EARLY ATTEMPTS 

The strategy builds on a phenomenon that 
has been appreciated for more than a cen- 
tury. Physicians in the 1800s noted that their 
cancer patients sometimes unexpectedly 
went into remission after experiencing a viral 
infection. These case reports later inspired 
doctors, particularly in the 1950s and 
1960s, to raid nature’s viral cupboard. 
Clinicians injected cancer patients with a 
menagerie of viruses. Sometimes the therapy 
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destroyed the tumour, and sometimes it 
killed the person instead. 

Unlike the wild viruses used in those 
mid-twentieth-century experiments, some 
of today’s anti-cancer viruses are painstak- 
ingly engineered. T-VEC, for example, has 
been altered to drastically reduce its ability 
to cause herpes. Researchers also inserted a 
gene encoding a protein that stimulates the 
immune system, which makes the virus even 
more potent against cancer (see ‘Going viral 
against cancer’). 

As more researchers entered the field and 
initiated small clinical tests, they began to 
produce enticing anecdotes. Russell recalls 
the case of an individual with myeloma who 
remained sick after undergoing two stem-cell 
transplants. A tumour on the left side of her 
forehead had degraded the bone underneath 
and was putting pressure on her brain. Yet 
treatment with an experimental virus sent 
her into complete remission (S. Russell et al. 
Mayo Clin. Proc. 89, 926-933; 2014). “She's a 
star patient who convinced us that this onco- 
lytic paradigm can really work,” he says. 

But statistics — not anecdotes — rule over 
drug approvals. In 2005, regulators in China 
approved an oncolytic adenovirus called 
H101 to treat head-and-neck cancer, after 
evidence showed that the treatment could 
shrink tumours. Those trials stopped short of 
assessing improvements in patient survival — 
a measure often required for FDA approval. 
Since then, a medical-tourism industry has 
built up in China for people who cannot get 
the therapy in their home countries. 

Then, in May this year, a team supported 
by biotechnology giant Amgen of Thousand 
Oaks, California, published promising 
results from a large clinical trial of T- VEC 
(R. H. Andtbacka et al. J. Clin. Oncol. 
33, 2780-2788; 2015). The virus both shrank 
tumours in people with advanced melanoma 
and extended patient survival by a median of 
4.4 months. Yet statistically, survival benefits 
fell just a hair's breadth of significance. “That 
raised the question, ‘Well, what is statistical 
significance? Is this an active agent or not?” 
Russell says. 

He and others note that the therapy — 
which must be injected directly into tumours 
— seemed to rein in cancer elsewhere in the 


GOING VIRAL AGAINST CANCER 


The virus-based cancer therapy T-VEC infects tumour cells and destroys 
them by stimulating the immune system to direct an attack against 


malignant cells in the body. 


Healthy cell Cancer cell 
T-VEC virus 
T-VEC enters but cannot 
replicate in normal cells. 


body as well. This is a sign that results are 
real and that the virus sparked an immune 
response as intended, Thompson says. 


ROOM FOR IMPROVEMENT 
Administering T-VEC in combination with 
cancer immunotherapy could prove par- 
ticularly effective, notes Stephen Hodi, an 
oncologist at the Dana-Farber Cancer Institute 
in Boston, Massachusetts. In June 2014, a small 
clinical trial by Amgen suggested that this 
combination may boost effectiveness over that 
of the immunotherapies alone. 

And researchers continue to look for ways 


T-VEC destroys malignant cells directly, releasing the protein 
GM-CSF and antigens that enable the immune system to 
target cancerous cells nearby and throughout the body. 


Antigen 


GM-CSF 


to improve T-VEC. In particular, they would 
like to be able to deliver the therapy systemi- 
cally, so that the virus could target tumours in 
organs that are difficult to reach with an injec- 
tion. This would require a technique to prevent 
the body from mounting an immune response 
to the virus prematurely, which would disable 
it before it could reach and kill tumour cells, 
says Howard Kaufman, a cancer researcher at 
Rutgers Cancer Institute of New Jersey. 

To that end, those in the field are experi- 
menting with a smorgasbord of viruses — from 
poxviruses to vesicular stomatitis virus, which 
does not normally infect humans but causes a 
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Dendritic cell 


GM-CSF attracts dendritic cells, which present tumour 
antigens to the immune system’s T-cells, programming 
them to destroy cancer cells throughout the body. 


blistering disease in cattle. Oncolytics Biotech 
is studying a virus that hitch-hikes through the 
body on certain blood cells, camouflaged from 
the immune system. 

If cancer-killing viruses could be delivered 
to their targets through the bloodstream, 
rather than via injection directly into the 
tumour, they could be used to treat a greater 
range of cancers. Thompson envisions a day 
when physicians will be able to peruse a menu 
of oncolytic viruses and select the best fit. 
“Each virus interacts with the immune system 
differently,’ he says. “They could havea role in 
pretty much all cancer therapy.” = 
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Residents of Kiribati make their way home through floodwaters. Rain and severe weather can exacerbate the high tides that often rush into villages. 


Before we drown 


we may die of thirst 


The island nation of Kiribati 
is one of the world’s most 
vulnerable to rising sea 
levels. But residents may 
have to leave well before the 
ocean claims their homes. 


BY KENNETH R. WEISS 
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igh tide left its mark on the houses like a dirty ring in a bath- 
tub. The flood crept into the village of Teaoraereke under the 
cover of darkness, sending filthy seawater sloshing through 
pigsties and shallow graves, and into people's homes. 

Teaoraereke residents scrambled to retreat, hoisting sleeping chil- 
dren, sodden bedding and other belongings to higher ground. But some 
stayed put, including Rerema Kauria, a 63-year-old grandmother who 
was marooned just inches above the floodwaters on a raised platform 
bed. She was still there by mid-morning as the water receded, her pos- 
sessions tucked into the rafters of her traditional house of wooden poles 
and thatch. She knew that when high tide returned that afternoon it 
would bring more flooding, but she gave a roaring laugh when asked if 
she had considered leaving. “Where would I go?” 

The uncertain future of people such as Kauria has drawn attention 
to a collection of atolls in the central Pacific Ocean that make up the 
Republic of Kiribati (pronounced Keer-re-bahs). The average height 
of the country’s 33 islands is little more than 2 metres above the ocean, 
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which makes Kiribati acutely vulnerable to climate change. By the end 
of the century, melting polar ice and the thermal expansion of warmer 
seawater is expected to raise global ocean levels by perhaps 1 metre. 
That upsurge would, according to some predictions, displace many from 
Kiribati and millions of others around the world — and the water will 
keep going up. 

For years, Kiribati President Anote Tong has sounded the alarm over 
his nation’s plight, warning that residents would soon have to abandon 
their homeland. The flooding that hit Teaoraereke last year reinforces 
those dire predictions. Although it is impossible to know how much, if 
at all, climate change contributed to the flooding, village residents say 
that they have never before seen such inundation. To some of them, it 
seemed as if the swelling seas were starting to consume Kiribati and the 
end of the atoll might come sooner than they had thought. 

But researchers who study Kiribati say that the situation is not a simple 
story of rising seas swallowing low-lying islands. In fact, some coastal 
experts dispute the idea that Kiribati will soon sink beneath the waves 
like a modern Atlantis. They have gathered evidence that many of these 
islands have been gaining ground in recent decades by capturing sedi- 
ments from surrounding coral reefs. “It's just plain wrong to assume that 
all atolls are washing away,’ says Arthur Webb, a coastal geomorphologist 
affiliated with the University of Wollongong in Australia who has spent 
two decades living and working in the Pacific Islands. “It’s also wrong 
to sugar-coat the sobering facts that rising sea levels will ultimately seal 
the fate of low-lying islands and their limited soils and groundwater. The 
confusion isnt surprising. It’s just more complicated than many expect.” 

Even if Kiribati does not drown in the near future, its residents may 
soon need an exit strategy. Poverty, overcrowding and poor sanitation 
are galloping ahead of rising seas to deplete the islands resources, espe- 
cially their supply of clean fresh water. And residents’ habits of altering 
the shoreline and removing coastal protections can magnify the impacts 
of the swelling oceans, leaving villages more exposed to flooding. 

The story playing out on these tiny islands shows how difficult it is to 
tease out the impact of climate change from other human and environ- 
mental pressures. And what happens to the Kiribati people has implica- 
tions for the hundreds of millions in low-lying coastal areas across the 
globe who will be threatened with flooding 
and displacement in coming decades. But 
unlike the residents of Miami, Guangzhou 
or Mumbai, the Kiribati people have no 
option of retreating inland or up-slope as 
their vulnerable flyspecks of land become 
uninhabitable. As Kauria says: where would 
they go? 


AT THE MERCY OF THE TIDES 

From the air, Kiribati’s Tarawa atoll emerges 

from the Pacific as narrow strands of land that join to form a wispy V 
shape. On the outside of the V is the deep blue of the ocean; inside are 
the aquamarine and turquoise waters of the shallow lagoon. 

Tarawa is the capital of Kiribati, which is one of the most remote 
countries on Earth, located on the equator about halfway between Aus- 
tralia and Hawaii. Its atolls are scattered across a patch of the Pacific the 
size of India, and yet they have a total of just 811 square kilometres of 
land, about half the size of Greater London. 

When a plane lands on Tarawa, a crowd gathers at the airport, drawn 
by the excitement of the jet making the three-hour flight from Fiji. Aside 
from occasional freighters bringing canned food, this twice-weekly Fiji 
Airways flight provides the primary connection to the outside world. 

The airport was built on relatively high ground, an elevation of 3 
metres, in one of the atoll’s widest sections. It happens to sit above the 
main subsurface reservoir, a freshwater layer floating on top of the sea- 
water that presses against the porous island from all sides. 

Although the sea presents an existential threat, the more immediate 
problem is not too much water, but too little — of the fresh, clean kind. 
A dozen of Kiribati’s islands are deserted, too arid to support human 


“SOME PEOPLE USE CALENDARS 
TO GET THROUGH THE WEEK. 
WE USE TIDE CHARTS.” 
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habitation. Without enough replenishing rains, their thin lenses of 
groundwater turned brackish. On Tarawa, groundwater is heavily over- 
drawn and contaminated by the local practice of defecating on the beach 
or in the bushes. With little land, residents bury their relatives and raise 
pigs next to their homes, which also contributes to groundwater pollution. 

Next to the road leading to the airport, a buried white plastic pipe 
that carries fresh water from the reservoir has been exposed in places, 
owing to erosion by waves and tides. Public workers have fought back 
by using old tyres filled with concrete to hold it in place. They have 
had less success keeping locals from tapping illegally into the waterline, 
directing the flow into hand-dug wells for their homes. Water supplies 
are so limited that authorities turn on the airport's groundwater pumps 
for only a couple of hours every other day. 

The perils of water, both sweet and salty, are intertwined with Tarawa’s 
history, says George Fraser, high commissioner to Kiribati from Aus- 
tralia, which is the biggest provider of international support to the devel- 
oping nation. In one of the bloodier clashes in the Pacific during the 
Second World War, US commanders misjudged the tides and landing 
craft got stuck on the reef, forcing marines to wade through chest-deep 
water under heavy Japanese fire. 

Fraser deconstructs that infamous battle as he takes a tour of the 
island the day before the king tide that flooded Teaoraereke. A fast and 
confident driver, Fraser weaves his small sport utility vehicle around 
wobbly wheeled trucks, dodging potholes. In narrow spots, the atoll’s 
main road is soaked with seawater and he swerves to avoid a wave 
splashing over a concrete berm. “Some people use calendars to get 
through the week,” he says. “We use tide charts.” 

The road is the only paved one on the atoll. It crosses a series of cause- 
ways that have been battered by wheels and waves; road crews repair 
cavities, stuffing them with as much concrete and sand as possible to 
slow the decay. The Australian government has bankrolled much of a 
repaving project along the length of South Tarawa. 

As Australia’s top representative, Fraser has a keen sense of the various 
challenges that this poor country faces in coming decades, and how they 
stack up. “Ifyou look at rising sea levels as the train coming down the 
track, it’s a couple of kilometres away,” he says. “If you look at what's 100 
metres down the track, it’s no water, and 
right behind it is no food.” 

More than half of Kiribati’s 110,000 resi- 
dents live on Tarawa, and their numbers 
are rapidly increasing as more arrive from 
outer islands seeking jobs, cash and better 
schools. Many were subsistence fishers and 
farmers on their home islands, struggling 
with depleted fisheries and poor soil dam- 
aged by periodic over-wash of salt water. 
When they get to Tarawa, they often end up 
jobless or underemployed. 

The Kiribati culture is communal, with families accustomed to bed- 
ding down together on woven mats on the floor. It is taboo to refuse the 
request of a relative, so households often pack dozens of extended fam- 
ily members from other islands under one roof. That has made South 
Tarawa one of the most densely packed places in the Pacific; its clusters 
of shanties resemble slums in the poorest capitals of Africa and Asia. 
Factoring in high birth rates and ongoing urbanization, the government 
projects that the population of the island will almost double in 15 years. 
The new Battle of Tarawa will be over where all these people will live. 


THE INCREDIBLE SHRINKING ISLAND 

Today’s scientific debate about whether Kiribati is growing or shrinking 
can be traced to Charles Darwin — who first worked out how coral atolls 
form. While sailing the Pacific on the HMS Beagle in the 1830s, he theo- 
rized that these curiously shaped sand islands are produced by coral reefs 
that sprouted on the slopes of volcanic islands and have continued to grow 
as the volcanoes sink into the abyss. He was proved right a century later, 
when scientists drilled into an atoll and hit volcanic rock. 
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Over the millennia, the exoskeletons of millions of tiny coral animals 
fuse with coralline algae and the shells of molluscs and other sea crea- 
tures to form limestone reefs, often arranged in a circle with a shallow 
lagoon in the middle. Living corals grow on the fringes of these lime- 
stone platforms. As the crest of the living reef reaches close to the ocean 
surface, waves break some of it into rubble and sand that gets deposited 
on the dead limestone platform to form land. 

The atolls that exist today are the survivors, ones in which coral reefs 
kept pace with rising seas and the subsidence of the undersea volcano. The 
pressing issue is, what will become of those atolls as sea levels start rising 
faster? Researchers wonder whether corals can keep up, given the host of 
environmental problems they face. In many places, overfishing and nutri- 
ent pollution have triggered the growth of coral-killing bacteria and algae. 
Abnormally warm seawater is causing ‘bleaching’ die-offs throughout the 
tropics, and as ocean water takes up more carbon dioxide and acidifies, it 
will be harder for coral polyps to build rugged exoskeletons. 

Around Tarawa, the coral reefs are in particularly poor shape, says 
Simon Donner, a climatologist at the University of British Columbia in 
Vancouver, Canada, who has done diving surveys. “Coral cover is lower 
than you'd expect around the island,’ he says. “That's the legacy of pollu- 
tion, sewage mostly, and frequent bleaching events in the past 20 years.” 

To help predict how corals may fare in the future, Dennis Hubbard, a 
geologist at Oberlin college in Ohio, and his colleagues have been peering 
into the past, amassing a database of sediment core samples obtained by 
drilling into limestone beneath coral reefs. With carbon dating, they can 
determine how quickly these reefs have grown: in yet-to-be-published 
work, they have found that more than half of the world’s coral reefs grew 
more slowly over the past 10,000 years than sea levels are rising today. 
Extrapolating forward, those results suggest that only half of all atolls in 
existence today have a chance of keeping pace with rising seas under the 
best of conditions, he says. “Given that this was in a time with no human 
impact, we feel this is the most optimistic scenario possible.” 


NOT SO FAST 

Kiribati and other low-lying island nations have long been held up as 
the countries most susceptible to the ravages of rising seas. In 2001, the 
Intergovernmental Panel on Climate Change (IPCC) highlighted! predic- 
tions that two-thirds of Kiribati and the nearby Marshall Islands would be 
inundated bya sea rise of 80 centimetres. 

But the idea that these atolls will disappear 
any time soon has been challenged by Paul 
Kench, a coastal geomorphologist at the Uni- 
versity of Auckland. He and his colleagues have 
pored over satellite images, comparing new 
and old aerial photographs to see how such 
islands have changed. 

Ina 2010 study, he and Webb determined’ 
that 23 out of 27 atoll islands scattered across 
Kiribati, Tuvalu and the Federated States of 
Micronesia had either increased in area or 
remained stable in recent decades. The results, 
they reported, “contradict widespread percep- 
tions that all reef islands are eroding in response to recent sea level rise”. 
The researchers concluded that these islands are more “resilient land- 
forms” than previously thought. The study created a media stir in the 
region and beyond. It has been widely cited by climate-change sceptics 
seeking to punch holes in research on global warming and its impacts. 

Kench recognizes the powerful forces of climate change but complains 
that too many scientists and activists focus solely on rising sea levels while 
ignoring the other part of the equation: how the land responds. “There 
are a lot of claims that islands are passive geological entities that will sit 
there and drown,’ he says. “Our work shows that they are anything but 
static. They are dynamic. They move around and they can grow. So just 
because sea level is rising, it doesn’t mean doom and gloom for all atolls.” 

He also believes that most scientists make a mistake by tethering the 
fate of atoll islands to the health of surrounding coral reefs. Even if reefs 
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A man rebuilds the sea wall that protects his home on South Tarawa. 


die, he says, they can provide sufficient sediment to maintain islands 
for a century or more. 

But Hubbard considers Kench’s views shortsighted. “If you run out of 
reefs, you run out of sediment, and once you run out of sediment, you run 
out of islands; he says. “A lot of this is a semantics issue, challenging when 
the reef island is going to be physically underwater. Those reef islands are 
going to be abandoned long before that because they are uninhabitable” 

On Tarawa and other Kiribati islands (see ‘Isolated islands’), most 
people do not dwell on such matters, going about their daily lives just 
like residents of other countries. But their president has earned inter- 
national recognition for speaking out on the threats of climate change. 

In an interview, Tong dismisses those who 
suggest that atolls are resilient to rising seas, 
saying that they have the luxury of “talking 
from the top of a mountain” and not putting 
their lives on the line. “These people are not liv- 
ing here. Their grandchildren will not be living 
here. If they believe that, let them come here,” 
he says, pounding his fist on a chair armrest 
for emphasis. “Id rather plan for the worst and 
hope for the best.’ 

Tong has told his people that they must 
prepare to leave, seeding the idea of an early 
“migration with dignity’, rather than fleeing 
as refugees when storm-generated waves wash 
over the islands. Last year, his government completed an US$8-million 
purchase of 22 square kilometres of hilly land in Fiji, to grow food and pro- 
vide possible refuge for some of his people — although it will not accom- 
modate all of them. He does not know when people will need to migrate, 
but he wants to purchase more land in Australia and New Zealand, saying 
that it is much cheaper than trying to build sea walls and other defences. 
“Tf we build up these lands, it’s going to cost billions of dollars,” he says. 
“We might as well be buying land for millions of dollars elsewhere.” 


DEVELOPING CHALLENGES 

For nearly a week after Teaoraereke flooded, resident Matua Kamori 
worked alongside his neighbours to build a makeshift sea wall where 
the high tide had breached a sand berm on the beach. Villagers piled up 
chunks of coral scavenged from the shore and grouted them together 
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Children play in a king tide that flooded their family’s land on Tarawa. 


with cement donated by a local church. 

Kamori, 33, lives in the village with his wife and 4 kids on a small parcel 
of land given to him by his wife's uncle in exchange for looking after one 
of his sons. To prepare the land, Kamori spent months scouring sand and 
coral gravel off the beach and hauling it to the site with nothing more than 
arice sack. Over time, he fashioned a building pad halfa metre high and 
constructed a hand-hewn traditional house of wood and thatch on it. Such 
beach mining is rampant on Tarawa, according to household surveys. 
Government studies’ show that it increases the likelihood of flooding by 
lowering the protective sand berm that keeps the highest tides at bay. 

In the case of the recent inundation, Kamori says that he fared better 
than most: the water reached calf-deep in his house, rather than thigh- 
high. But nothing could be done to stop the briny stew of salt water, 
mixed with human and animal waste, from polluting his well or killing 
his garden of vegetables and banana and papaya trees. 

Kamori says that he settled on the land because he had nowhere else to 
go. As crowding increases, new settlements are pushing into vulnerable 
lowlands, places they historically would have avoided. 

The individual actions of settlers such as Kamori are only part of the 
problem on Tarawa. Large-scale construction projects over the years 
have also exacerbated flooding and erosion, says Naomi Biribo, Kiri- 
bati’s secretary of fisheries and marine resources development. Biribo 
earned a PhD in Australia by examining the impact of the sea walls 
and other human structures on Tarawa. The construction of causeways, 
rather than bridges, to connect the islets had the effect of closing chan- 
nels and disrupting the flow of sediment that normally resupply some 
eroding coastlines, she found’. Reclamation projects that create new 
land are another problem: although such efforts have added hundreds 
of hectares to Tarawa, they accelerate erosion elsewhere, says Biribo. 

For Tarawa residents, she says, the thin ribbons of land leave little 
room to move. “In many places on Tarawa, you can stand in the middle 
and you can see the ocean on one side and still see lagoon on the other 
side,’ she says. “If we retreat from the ocean side, and institute a setback, 
we will fall into the lagoon” 

Biribo’s work suggests that sea-level rise may be having a small influ- 
ence on the shoreline changes happening today, but nothing compared 
to human activities and the seasonal variations in erosive tides and 
waves in the Pacific that come with the El Nifio periods of warming 
and La Nifa cooling. 


ISOLATED ISLANDS 
Tarawa is the capital 

of the Republic of 
Kiribati, a collection 

of 33 islands. 
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Donner agrees that climate change has been dwarfed by other fac- 
tors so far. “You cannot blame the flooding on sea-level rise; he says. 
“At least not yet.” 

Where does this leave residents of Kiribati? Webb has long wrestled 
with that question. He is married to a woman from Tarawa, and they 
own a house there, where they live with their children for part of the year. 

Webb was also a lead author of the small-islands chapter” of the 
IPCC’s fifth assessment report in 2014, which found that rising seas pre- 
sent “severe sea flood and erosion risks for low-lying coastal areas and 
atoll islands” It highlighted one projection® that a 50-centimetre rise in 
sea level could displace 1.2 million people from low-lying islands in the 
Caribbean Sea and the Indian and Pacific oceans; that number almost 
doubles if the sea level rises by 2 metres. And yet, the latest assessment 
steered clear of the IPCC’s previous assertion that an 80-centimetre rise 
would inundate two-thirds of Kiribati. 

Scientific understanding of atoll geology has sharpened since that 
earlier projection. Webb expects some remnants of Tarawa to remain 
a century or two from now, but probably no more than some wave- 
washed gravel banks — and by that point, everyone will have long gone. 

The geological evidence does not get to the key human question about 
the destiny of these Pacific islanders. That leaves Webb facing a difficult 
question — one he hears from his own Kiribati-born teenagers. “How 
long do we have?” they ask. To that, he replies: “Your children will not 
grow old in the atolls” = 


Kenneth R. Weiss is a freelance writer in Carpinteria, California. 
A grant from the Pulitzer Center on Crisis Reporting contributed to 
research for this article. 
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HOW TO MAKE THE MOST OF 


CARBON DIOXIDE 


Researchers hope 
to show that using 
the gas as araw 
material could 
make an impact on 
climate change. 


BY XIAOZHI LIM 


n 29 September, the XPRIZE Founda- 

tion based in Culver City, California, 

announced a 4%-year competition 

that will award US$20 million to the 

research team that can come up with 
the best way to turn carbon dioxide from a 
liability into an asset. 

With gigatonnes of the gas pouring into the 
atmosphere each year, and with the conse- 
quences for global climate becoming increas- 
ingly obvious, the Carbon XPRIZE would 
reward technologies that can convert CO, emis- 
sions from coal and natural-gas power plants 
into useful products such as alternative building 
materials, fuels and raw material for the manu- 
facture of plastics and other chemicals. 

The invitation should have plenty of takers: 
a growing number of companies and research 
chemists are already pursuing that goal. In 
Reykjavik, what looks like an overgrown 
playground climbing frame is actually a small 
chemical plant that turns CO, into methanol: a 
fuel that can also be used to manufacture prod- 
ucts ranging from paints to wrinkle-resistant 
textiles. In Houston, Texas, another small 
plant is turning CO, into materials that go 
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on to become coatings and adhesives. And in 
Tokyo, Japan, Asahi Kasei Chemicals is widely 
licensing its technique for turning CO, into 
the polycarbonate plastics used in bulletproof 
glass, spectacle lenses and electronic parts. 

Using this greenhouse gas as a raw material 
is an idea that many scientists once dismissed 
as hopeless, says Chunshan Song, a chemical 
engineer at Pennsylvania State University, Uni- 
versity Park. As a practical matter, he says, “lots 
of people believed that nothing could be done 
with CO, utilization” after the stuff went up 
the smokestack. 

As a source of carbon, sceptics argued, the 
gas was far more difficult and expensive to 
obtain than the petroleum, coal and natural 
gas that now provide the raw material for most 
chemical manufacturing. And even if CO, 
could be captured cheaply enough, converting 
such a stable molecule into more-useful chem- 
icals would generally require lots of energy, 
which might well come from fossil-fuel plants. 
The conversion could cost a fortune and make 
more CO, than it consumed. 

But the balance is starting to shift. New 
conversion technologies are allowing 
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Pellets of urea fertilizer 
are made from carbon 
dioxide in a plantin 
Daharki, Pakistan. 


the energy-hungry 
chemical reactions 
to proceed more effi- 
ciently. Renewable 
sources such as solar 
and wind can increasingly supply that energy 
at a competitive cost without the carbon pen- 
alty. And solutions to the capture problem 
may be forthcoming if governments follow the 
recommendations of the Intergovernmental 
Panel on Climate Change (IPCC) and man- 
date carbon capture and storage — grabbing 
the CO, coming out of power plants and other 
industries, and locking it away underground. 
But rather than simply burying it, companies 
could defray the cost of capture by putting 
the gas to productive use (see Nature 526, 
306-307; 2015). 

Michele Aresta, a chemist at the Univer- 
sity of Bari in Italy, estimates that if currently 
known processes were deployed most effi- 
ciently and at the greatest possible scale, they 
could directly use some 300 million tonnes 
of CO, per year, while indirectly reducing 
emissions by around a gigatonne per year — 
roughly 5% of the total net emissions’. This is 
hardly a total solution to the CO, problem, he 
says, but it is substantial nonetheless. And that 
is not counting the additional benefits, which 
include a shift away from relying on fossil fuels 
as a source of carbon for materials and chemi- 
cals, as well as the use of CO,-derived fuels to 
store and transport energy from wind, solar 
and other intermittent sources. 

The Carbon XPRIZE may well accelerate 
these developments. But research into CO, 
conversion has already been receiving support 
from funding agencies worldwide. Since 2009, 
for example, the German Ministry of Education 
and Research has pumped in around €100 mil- 
lion (US$110 million) to support research on 
the use of CO, in chemical and synthetic-fuel 
production. In 2011, the US Department of 
Energy invested $106 million in various CO,- 
utilization projects. The European Union has 
lined up a 2020 Horizon Prize worth €1.5 mil- 
lion for a technology that demonstrates viable 
CO, reuse. And over the next five years, China 
is expected to invest some 30 billion yuan 
($4.7 billion) in CO, recycling in the coal, steel, 
cement and paper industries. 


LOW-HANGING FRUIT 
Industrial uses of CO, have been around for 
generations. Every year, about 20 million tonnes 
of the gas are used ‘as is’: for example, it’s the ‘fizz’ 
in fizzy drinks, and it can also be used as a high- 
pressure solvent to clean electronics. But gas 
used in this way quickly returns to the atmos- 
phere, where it contributes to global warming. 
Another 114 million tonnes of CO, go into 
the production of urea fertilizer every year, 
accounting for more than 60% of the total 
worldwide consumption of the gas. But urea 
is usually made by reacting CO, with ammo- 
nia — most of which is made with hydrogen 


derived from fossil fuels. Researchers such as 
Peter Styring, a chemical engineer at the Univer- 
sity of Sheffield, UK, are working with industrial 
partners to develop catalysts and reaction con- 
ditions that will allow the production of urea 
directly from nitrogen, water and CO,. But 
currently, the urea industry is still a net source 
of CO,. 

In the quest to make more materials from 
CO, without adding to the greenhouse prob- 
lem, chemists’ first challenge is to overcome 
the molecule’ stability. Unlike a carbon atom 
in isolation, which has four electrons that are 
available to form bonds, the carbon in CO, has 
given up all four to the tight grip of the oxygen 
atoms. That grip has to be loosened before 
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calcium carbonate manufactured — versus 
zero kilograms for every tonne that is mined. 
In Asia, meanwhile, manufacturers licens- 
ing technology developed by Asahi Kasei 
Chemicals are using CO, to produce about 
660,000 tonnes of polycarbonates annually, 
roughly 14% of the global total. Polycarbon- 
ates are plastics used in products ranging 
from water bottles to spectacle lenses. They 
are usually made in a reaction involving 
phosgene (COCL,): a toxic compound that is 
infamous for its use as a poison gas in the First 
World War. It is typically produced by com- 
bining chlorine with carbon monoxide made 
by burning coal. The Asahi Kasei process 
replaces phosgene with CO, — overcoming the 


CHEMISTS FIRST CHALLENGE IS 10 
OVERCOME THE MOLECULE 'S STABILITY. 


the carbon can form new bonds — and that 
takes energy (see “The Carbon Challenge’). “If 
it [energy] comes from fossil fuels, it’s point- 
less,” says Styring. “Everything has to be from 
renewable energy.’ 

This energy cost of separating the carbon 
and oxygen atoms is so exorbitant that some 
researchers in the field simply sidestep the 
problem. They focus on products such as cal- 
cium carbonate (CaCO,), which can be made 
by combining CO, with calcium compounds 
in low-energy reactions that leave the car- 
bon-oxygen bonds relatively undisturbed. 
Calcium carbonate also happens to have a large 
potential market: it is used as a construction 
material and for whitening factory-produced 
paper. And because it is more inert than CO,, 
it could be used to store the gas over long time 
scales — something that already happens 
naturally in limestone deposits and coral reefs. 
“Nature has used mineralization to store mil- 
lions of billions of tonnes of CO,,.’ says Martin 
Devenney, chief operating officer at Calera, a 
green-energy company in Los Gatos, California. 

Most of the calcium carbonate now used in 
industry, an estimated 15 billion tonnes per 
year, comes from mining such deposits. But 
since 2013, Calera has been operating a dem- 
onstration facility that produces the material 
from CO, and carbide residue: an industrial 
waste that comes from the production of poly- 
vinyl chloride (PVC). Calera first adds water to 
the residue to extract calcium hydroxide, then 
bubbles CO,-rich flue gas from a nearby indus- 
trial plant through the solution to obtain pure 
calcium carbonate, which is turned into fibre 
cement boards that can be used in construction. 

After factoring in energy costs, says Deven- 
ney, the company’s process captures about 
170 kilograms of CO, for every tonne of 


molecule’s stability by putting it through three 
intermediate transformations. 

Asahi Kasei’s process is still unable to make 
polycarbonate chains that are long enough 
for applications such as water bottles. But it is 
cost-competitive with the standard method, 
and according to Aresta’s estimates, reduces 
indirect CO, emissions from 6 tonnes for every 
tonne of polycarbonate produced to 1 tonne. 

Yet another promising area for low-energy 
CO, utilization is in the production of poly- 
ols: a group of sugar-like molecules that are 
often used as the raw material for polymers 
such as polyurethanes, which are then used in 
mattresses, adhesives, coatings, refrigerator 
insulation and Spandex. Polyols are mostly 
made from precursor molecules known as 
epoxides. But Novomer, a green-chemistry 
company in Waltham, Massachusetts, has 
developed a cobalt-based catalyst that allows 
half of the epoxides to be replaced with CO). 
“Our catalyst will not work without the carbon 
dioxide,’ says Peter Shepard, executive vice- 
president of Novomer. The Novomer process 
is currently being tested at a plant in Houston, 
he says, where it is producing 3,000-4,000 
tonnes of polyols per year while generating 
about one-third of the CO, that would be pro- 
duced through conventional methods. And the 
company has begun designing a facility with a 
capacity of 100,000 tonnes per year, scheduled 
for completion in 2017. 


AN UPHILL TASK 

Despite these successes, the demand for low- 
energy products such as polycarbonates and 
inorganic carbonates is relatively limited. The 
big money is in hydrocarbons, which consist of 
one or more carbon atoms, often arranged in 
long chains or complex rings, surrounded by 
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Turning carbon dioxide into more-useful chemicals 
requires an input of energy to break its strong bonds. 
Each one-carbon molecule in this sequence has a 
higher chemical energy than the one before. 
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hydrogen atoms. As well as their many other 
uses, hydrocarbons are ubiquitous in petrol, 
diesel, jet fuel and just about every other liquid 
energy source on Earth — a market roughly 14 
times larger than that for non-fuel chemicals. 
Making hydrocarbons out of CO, is a “com- 
pletely different story” from making low- 
energy products, says Aresta. In addition to the 
energy required to break the carbon atom free 
from oxygen so that it can link up with other 
carbons, the process needs a source of cheap 
hydrogen — which currently comes from fossil 
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As with other efforts to make fuels from 
CO,, however, the challenge is to do it effi- 
ciently: even in the laboratory, the known 
methods consume far more energy than the 
resulting fuels can provide. Among the most 
vexing problems is how to obtain the hydro- 
gen. Right now, the choices are to extract the 
gas from fossil fuels, a source that is cheap 
but hardly green, or to make it by splitting 
water, a process that is both energy intensive 
and costly. 

“We need at least 15 years of good research” 


MAKING HYDROCARBONS OUT OF CO, IS 
A COMPLETELY DIFFERENT STORY. 


fuels. And it is not easy to control the reactions 
to get a target hydrocarbon molecule instead of 
a related molecule that is one carbon shorter or 
longer. “This is tricky,’ says Song. 

Still, researchers are making progress. 
Since 2010, for example, Song has developed 
a series of catalysts based on iron and cobalt 
that can bring CO, molecules close together 
with hydrogen molecules at 300°C and about 
10 atmospheres, forming ethylene, propylene 
and butane’. Because his experiments are still 
being done in the laboratory, Song cannot yet 
say how much his process could reduce green- 
house-gas emissions when carried out on an 
industrial scale. But any headway could have 
a large effect: these three chemicals are among 
the most heavily used in the chemical indus- 
try, says Song, with demand measured in the 
hundreds of millions of tonnes per year. 

Song is also developing palladium- 
copper catalysts that can efficiently turn CO, 
and hydrogen into methanol. “Mother Nature 
makes biomass and it takes months and years,” 
he says. “We can do it in seconds.” 


to do better, estimates Aresta. And even then, 
he says, the conversion will probably not have 
a net benefit for climate unless the manufac- 
turing plants can make extensive use of cheap 
energy from solar, wind and other renewa- 
bles. “If we don’t use these kinds of energy, we 
will never be able to go to large-scale deploy- 
ment,” he says. 


FUELS AND FUTURE 

Some companies are already trying to do just 
that. In Reykjavik, for example, Carbon Recy- 
cling International (CRI) makes methanol 
from CO, and hydrogen using the emissions 
and electricity from a neighbouring geother- 
mal power plant. The CO, in this case comes 
not from fossil fuel but from carbonate rocks 
baking in Earth’s heat deep underground. 
The hydrogen comes from the electrolysis of 
water — a source that would be prohibitively 
expensive if the power plant's electricity were 
not so abundant. CRI currently produces about 
5 million litres of methanol per year, or about 
1.5% of the global production of methanol, 
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which has 95% less CO, emissions than pet- 
rol. The advantage for Iceland is that metha- 
nol, being a liquid fuel, is a much easier way 
to export the country’s wealth of geothermal 
energy than, say, laying an underground elec- 
tricity cable to Europe. But for CRI, the plant 
is a test bed for wider deployment of its CO,- 
to-fuels technology, which it plans to market 
in other countries such as Germany as a way to 
capture and store large amounts of renewable 
energy at times of low demand. “It's a matter of 
accessing energy at the right time,” says K.-C. 
Tran, cofounder and chief executive of CRI. 

In most places, because renewable energy 
isn’t widely and cheaply available, the fuels 
from CO, are still not competitive with those 
currently being made from crude oil, says Jim 
Yang Lee, a chemical engineer at the National 
University of Singapore. “It solves certain, 
localized problems, assuming you have cer- 
tain resources,” he says, “but it’s not solving 
the global CO, problem” 

The response from proponents is that it does 
not have to solve the problem all on its own: 
if some researchers manage to make progress 
with CO,-to-fuels conversion, they argue, 
while CO,-based chemical manufacturing 
continues to expand, CO, utilization could 
make a noticeable dent in greenhouse-gas 
emissions in the near future. 

This year, Styring evaluated various scenar- 
ios of CO, utilization®. If 100% of urea, 30% 
of minerals, 20% of specific chemicals and 
polymers, 10% of methane and 5% of diesel 
and aviation fuels were supplied by currently 
known CO,-utilization methods, he estimates 
that around 1.34 gigatonnes of CO, would be 
consumed per year. This equals around 83% 
of the IPPC’s 2030 global target for emissions 
reductions through CO, capture and storage. 
“These are very conservative estimates,” says 
Styring. “It is likely that the impact will be 
much greater.’ 

For this to happen, researchers acknowledge 
that it will take far more than clever chemis- 
try. Governments and industries worldwide 
might have to consider CO, utilization along 
with storage, or even implement a carbon tax 
to help synthetic fuels stay competitive. “If you 
have to pay for CO, to go into the atmosphere, 
then the story will change,” says Song. 

Aresta is optimistic. “If we are able, in 20 
years from now, to use solar and wind energy 
in a concrete way,’ he says, “and use it to bring 
back CO, into fuels, we can very, very easily 
reduce the emissions of CO, by greater than 
10%.” m 


XiaoZhi Lim is a freelance writer in 
Singapore. 
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Even extreme environments such as Antarctic ice lakes host microbes. 


Create a global 
microbiome effort 


Understanding how microbes affect health and the 
biosphere requires an international initiative, argue 
Nicole Dubilier, Margaret McFall-Ngai and Liping Zhao. 


icrobes have been discovered on 
Earth wherever anyone has looked 
for them, from the boiling waters 


of Yellowstone's hot springs in Wyoming 
to the depths of cold, dark Antarctic lakes 


under 800 metres of ice. A holistic under- 
standing of the role of Earth’s microbial com- 
munity and its genome — its microbiome 
— in the biosphere and in human health is 
key to meeting many of the challenges that 


face humanity in the twenty-first century, 
from energy to infection to agriculture. 

Recognizing this, a group of leading 
US scientists this week proposes' the creation 
of a Unified Microbiome Initiative (UMI). 
The UMI would bring together researchers 
and representatives from public and private 
agencies and foundations to study the activi- 
ties of Earth’s microbial ecosystems. 

The UML is conceived as a US initiative; 
springing from meetings sponsored by the 
White House Office of Science and Tech- 
nology Policy and the Kavli Foundation of 
Oxnard, California. But Earth’s biome is not 
defined by national borders, and efforts to 
unlock its secrets should go global. 

We believe that to be successful, micro- 
biome research will require a coordinated 
effort across the international community 
of biologists, chemists, geologists, math- 
ematicians, physicists, computer scientists 
and clinical experts. As three scientists 
working in three countries — Germany, 
China and the United States — we call for 
an International Microbiome Initiative 
(IMI) supported by funding agencies and 
foundations around the world, in addition 
to the UMI. This would ensure the sharing 
of standards across borders and disciplines, 
and bring cohesion to the multitude of 
microbiome initiatives that exist. 


MICROBIAL REVOLUTION 

Science is only just realizing the full impor- 
tance of the microbial world. This is thanks to 
developments such as low-cost high-through- 
put sequencing; advances in sample prepa- 
ration that allow researchers to sequence 
genomes from individual cells as well as from 
microbial communities; improvements in 
computing power and imaging technologies; 
and the development of bioinformatics tools 
to help make sense of the data. 

Thus biologists are gaining insight into the 
identity and function of microbes that can- 
not be grown in the laboratory — the vast 
majority of Earth’s microbiome. Currently 
only 35 bacterial and archaeal phyla are rec- 
ognized on the basis of classical approaches 
to microbial taxonomy. Sequencing efforts in 
the past few years have pushed the number 
closer to 1,000 (ref. 2). 

Newfound groups of bacteria are throw- 
ing old assumptions about the tree of life into 
question, and revealing vast holes in our 
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> understanding of the planet's biosphere 
and its evolution. The discovery in 2003 of 
giant viruses with hundreds or even thou- 
sands of genes shattered the existing defini- 
tion of living organisms’. (Viruses had long 
been considered to straddle the line between 
living and non-living things because of their 
extreme reliance on host genes.) 

It is also becoming clear that microbes 
provide ecosystem services that are crucial 
to local and global sustainability. The micro- 
biota in and on crops, trees and other plants, 
and in the soils in which these grow, provide 
nitrogen, phosphorus and other essential 
nutrients. They break down pollutants and 
suppress the activity of pathogenic microbes. 
Recognizing the untapped power of soil and 
plant microbiomes in enhancing agricultural 
productivity, companies such as Monsanto 
are investing millions of dollars in research 
and development in this area. 

Microbes in the oceans produce 50% of the 
oxygen we breathe, and — through photosyn- 
thesis — remove roughly the same proportion 
of carbon dioxide from the atmosphere. They 
also remove up to 90% of methane from the 
world’s oceans. Over the past decade, research 
cruises such as Tara Oceans and the Global 
Ocean Sampling Expedition have sampled, 
sequenced and analysed the ocean's micro- 
organisms. These have provided insight into 
the roles that marine bacteria, archaea, viruses 
and eukaryotic microbes have as global pri- 
mary producers that provide nutrition at the 
base of the food chain; remineralization (the 
transformation of organic molecules into 
inorganic forms); and the deposition of car- 
bon on the sea floor. 

Some of the most profound insights in the 


crucial role of microbes for human well-being 
have emerged from analyses of the microbes 
on and in our bodies — their genomes, tran- 
scriptomes, proteomes and metabolomes. 
(These are analyses of genes, RNA molecules, 
proteins and chemical metabolites). Com- 
plex gut communities, for instance, protect 
us from disease, provide nutrition, and affect 
our development even before birth’. 


STUMBLING BLOCKS 

There are two major stumbling blocks to 
advancing our understanding of microbes’ 
role in the biosphere. First is the fragmenta- 
tion of the life-sciences field. Second is a lack 
of coordination among the various micro- 
biome research endeavours under way. 

Disciplinary silos are problematic because 
any attempt to understand anything about 
plants or animals needs to be rooted in micro- 
biology. For example, for decades, circadian 
rhythms in the mammalian gastrointestinal 
system were studied in the context of human 
physiology and gene expression. Yet in the 
past two to three years, biologists have discov- 
ered that daily cycles in the motility of the gut 
— the production of digestive enzymes, gene 
expression in gut cells and so on — rely on 
the activities of gut microbiota. Compounds 
produced by microbes either cause changes in 
the gut directly or pass into the host’s blood- 
stream and influence the central nervous sys- 
tem, possibly through neural, hormonal and 
immune pathways’. 

The first hint of this came from the 
discovery (led by one of us, M.M.-N.) that 
bioluminescence and other products of 
the marine luminous bacterium Vibrio fis- 
cheri regulate the expression of a circadian 


GOING GLOBAL 


Four functions of an International Microbiome Initiative 


Guidelines. Create a working group 

to oversee the development and 
implementation of guidelines for the study 
of microbiomes, drawing on and developing 
those already established by other 
initiatives such as the Earth Microbiome 
Project. The group would set standards 

for methods, data analysis, data sharing 
and intellectual-property rights, and 

would partner with funding agencies and 
publishers to ensure that researchers follow 
the agreed guidelines. 


Priorities. Develop a common research 
agenda, with the goal of enabling 
comparative analyses that range from local 
to global scales. For instance, one priority 
could be to increase the number and 
diversity of people sampled in studies of the 
human microbiome. 


Tools. Identify new cross-disciplinary 
methods for microbiome studies. Examples 
include imaging techniques ranging from 
confocal advances to cryotomography — 
which can resolve subcellular structure and 
reveal microbial cell function — and ways 
to monitor the production and exchange of 
microbial metabolites. 


Forums. Establish platforms for the 
discussion and exchange of research 
within and between nations, the 
development of programmes for training 
the next generation of microbiome 
scientists, and the establishment of 
outreach projects to educate and engage 
the general public. A good model is the 
Ocean Sampling Day’s citizen’s science 
campaign, which recruits citizens to help 
obtain environmental data. 
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cryptochrome gene in squid’. This high- 
lights the value of investigating host- 
microbe relationships from all branches of 
the tree of life, including those in which only 
a single symbiotic species is involved. 


PROJECT PROBLEMS 

Since a 2005 workshop in Paris, at least eight 
programmes have been established to study 
the human microbiome. These include the US 
Human Microbiome Project, the Canadian 
Microbiome Initiative, MetaHIT (involv- 
ing the European Union and China) and the 
Human Metagenome Consortium in Japan. 

These initiatives have generated vast 
amounts of data that are not easily compa- 
rable. For example, many studies on human 
microbiota identify species (or operational 
taxonomic units) and map evolutionary rela- 
tionships using the 16S ribosomal RNA gene. 
Differences between the primers (which pro- 
vide a starting point for the DNA synthesis) 
used to amplify this gene can have a big effect 
on the sequence data ultimately obtained. 
Andestimates of species numbers can vary by 
two to three orders of magnitude, even when 
applied to the same sample, because different 
software packages are being used to analyse 
the amplified genes’. 

This lack of consistency in approaches 
means that effective comparisons and inter- 
pretations of human microbiota studies are 
often not possible. The International Human 
Microbiome Consortium, established in 
2008, and the International Human Micro- 
biome Standards project, launched in 2011, 
have attempted to address some of these 
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Microbes such as Planococcus halocryophilus (2), cyanobacteria (4) and 


Arthrobacter crystallopoietes (5) are found everywhere from soils (6) to the 
Arctic’s Chukchi Sea (1) to Yellowstone’s Grand Prismatic Spring (3). 


issues. But, especially in the case of the stand- 
ards project, many national research projects 
were already well under way. Both initiatives 
are struggling with issues about data sharing 
and differences between national policies in 
ownership and property rights. 


GLOBAL SOLUTION 

We think that an IMI could do a better job. 
The study of any microbiome demands 
myriad collaborations. These should involve 
basic and applied biologists, including those 
with expertise in microorganisms or in higher 
organisms; informaticians and mathemati- 
cians, who can develop methods that extract 
information from the mountains of sequence 
data; and chemists, physicists and engineers. 
Physical scientists are needed to find new 
ways to measure and manipulate the com- 
pounds that microbes produce and exchange 
with their biotic and abiotic surroundings. 

An IMI would be pivotal in bringing 
together all these experts and allow scien- 
tists to move beyond cataloguing, which has 
characterized much of microbiome work so 
far. And, it would be limiting to recruit such 
a vast range of intellectual, scientific and 
technological expertise from one country 
alone. 

IMI participants could use comparative 
approaches to reveal the factors that under- 
lie the structure and function of microbi- 
omes on local to global scales. An example 
of the potential of comparative approaches 
comes from studies of Native Americans* 
and African hunter-gatherers’. It seems that 
these groups have a much higher diversity 


of microbial partners (a known correlate 
with better health) than do people living in 
industrialized societies. 

By pooling data from scientists from 
around the world, an IMI would gener- 
ate much more knowledge than could one 
country alone. Thanks to the falling costs 
of sequencing machines, individual labs 
will probably soon produce more data than 
the conventional large sequencing centres, 
such as the Joint Genome Institute in Wal- 
nut Creek, California. Yet for any one labo- 
ratory, sample sizes might be restricted, and 
researchers might have only limited bioinfor- 
matics capacity. 

An IMI could 
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tant for countries 
that may not have 
the funds to invest in their own global-scale 
projects. For example, cloud-computing 
platforms would allow people to upload 
and analyse sequencing data as soon as they 
are available. The IMI could also control and 
organize access to metadata (the associated 
host disease phenotype data, for a human 
gut microbiota sample, for instance) with- 
out which meaningful interpretation of 
the data is not possible. This could also be 
a way to safeguard the intellectual property 
of researchers, funding bodies and nations. 
Most importantly, an IMI is essential 
when it comes to solving problems that affect 
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the biosphere. Although processes involving 
microbiomes vary from place to place, the 
impact of such processes can often be felt 
globally. Potent greenhouse gases, such as 
nitrogen oxides produced by denitrifying 
bacteria in overfertilized Chinese farming 
lands or methane released by archaea in the 
millions of ruminant animals in Australia 
and New Zealand, may have contributed 
substantially to global warming. Billions 
of tonnes of human-made toxic chemi- 
cals have overwhelmed the degrading and 
recycling capacity of microbiomes. And the 
imprudent use of antibiotics has contributed 
worldwide to epidemics of chronic diseases, 
such as obesity, diabetes and cancer. The 
solutions to some of these problems may 
come out of local research, but an IMI is 
essential for ensuring that comparable data 
are produced from efforts throughout the 
international scientific community. 

We do not have all the answers when it 
comes to designing an IMI. In fact, we think 
that the first step to launching such a project 
should be the bringing together of leading 
microbiome researchers from across the 
globe to discuss its goals. However, even 
at this stage, some elements seem crucial 
(see ‘Four functions of an International 
Microbiome Initiative’). 

As long as communities are prepared to 
start afresh and conform to new ground rules, 
we believe that an IMI could succeed where 
other efforts to achieve standardization and 
coordination have struggled. Such a project 
would be ahead of the curve, because the 
tools needed to explore the world’s microbes 
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have only recently become available. Also, 
if multiple working groups with repre- 
sentatives from across the life sciences 
were established — similar to those set up 
by an effort to assess marine organisms 
called the Census of Marine Life — an IMI 
could be much broader in scope than any 
pre-existing programme. 

Finally, an IMI would be able to solve 
the data sharing and intellectual-property 
issues that have been stumbling blocks for 
previous efforts, by organizing and con- 
trolling access to the metadata that are so 
essential for interpreting results, publish- 
ing papers and filing patents. 

It is crucial that the IMI is launched 
quickly to avoid corrective actions hav- 
ing to be applied after-the-fact to national 
efforts. We invite private foundations 
that have been pivotal in mounting 
international research efforts to support 
an IMI. These include the Gordon and 
Betty Moore Foundation with its Marine 
Microbiology Initiative, the Alfred P. Sloan 
Foundation with its Census of Marine Life, 
the Kavli Foundation with its Brain Initia- 
tive, and the Bill & Melinda Gates Founda- 
tion with its Global Health Program. 

We also encourage national funding 
agencies to open up their programmes to 
international collaborations and to adopt 
any standards established by an IMI. 
Finally, we urge scientists to help make 
an IMI happen by sharing their data. 

So much can be gained by creating an 
IMI. Further uncoordinated national 
microbiome programmes will almost 
certainly waste research efforts and tax- 
payers’ money. Let’s transcend national 
silos and gain universal insights that will 
benefit all humankind. = 


Nicole Dubilier is a director at the Max 
Planck Institute for Marine Microbiology 
in Bremen, Germany. Margaret 
McFall-Ngai is professor and director of 
the Pacific Biosciences Research Center 
at the University of Hawaii at Manoa, 
Honolulu, Hawaii, USA. Liping Zhao 

is professor of microbiology at Shanghai 
Jiao Tong University, Shanghai, China. 
e-mail: ndubilie@mpi-bremen.de 
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The world’s oldest continually operational university was founded in Fes, Morocco, in AD 859. 


Revive 
universities of the 
Muslim world 


To boost science, higher-education institutes must give 
students a broad education and become meritocratic, 
say Nidhal Guessoum and Athar Osama. 


he Islamic civilization lays claim to the 

| world’s oldest continually operational 
university. The University of Qarawi- 

yyin was founded in Fes, Morocco, in aD 859, 
at the beginning of an Islamic Golden Age. 
Despite such auspicious beginnings, univer- 
sities in the region are now in dire straits, as 
demonstrated by a report we have authored, 


released this week (see go.nature.com/korli3). 

The 57 countries of the Muslim world — 
those with a Muslim-majority population, 
and part of the Organisation of Islamic Coop- 
eration (OIC) — are home to nearly 25% of 
the world’s people. But as of 2012, they had 
contributed only 1.6% of the world’s patents, 
6% of its academic publications, and 2.4% of 
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QUARTER DECK 


Investing on average 0.5% of their gross domestic product (GDP) on research and development 
(less than one-third of the global average), the Organisation of Islamic Cooperation (OIC) countries’ 
science output lags, as measured by patents and publications. 


8 OIC countries Rest of the world 


25% 


Population 


R 75% 


the global research expenditure’” (see ‘Quar- 
ter deck’). There have been only three Nobel 
laureates in the sciences from OIC countries; 
today these nations host fewer than a dozen 
universities in the top 400 of the many world 
rankings, and none in the top 100. 

To assess this situation, for the past two 
years we have led an international non-gov- 
ernmental and non-partisan task force of 
experts, formed by the Muslim World Science 
Initiative. The task force was chaired by Zakri 
Abdul Hamid, science adviser to the prime 
minister of Malaysia. It included a dozen 
experts and scholars — including policymak- 
ers, vice-chancellors, professors, and science 
communicators — from around the world. 

Our work confirmed many widely known 
problems, as highlighted by reports such 
as the Royal Society’s 2014 Atlas of Islamic 
World Science and Innovation”. For exam- 
ple, OIC countries on average invest less 
than 0.5% of their gross domestic product 
(GDP) on research and development (R&D). 
Only Malaysia spends slightly more than 1% 
(the world average is 1.78%; most advanced 
countries spend 2-3%). Students in the Mus- 
lim world who participate in standardized 
international science tests lag well behind 
their peers worldwide, and the situation 
seems to be worsening”. 

Our report highlights an even more 
problematic situation. University science 
programmes are using narrow content and 
outdated teaching methods. In most OIC 
countries, students are channelled into sci- 
ence or non-science streams around the 
age of 14, and their education thereafter is 
completely binary: science and technology 
students receive little in the way of humani- 
ties, social-science, language or arts educa- 
tion, and vice versa. Only one university in 
the region offers a programme in ‘science and 
technology studies’: the University of Malaya 
in Kuala Lumpur. 

To become beacons in society, OIC univer- 
sities need to revitalize their teaching meth- 
ods and meld science with liberal arts such 
as history and philosophy. For universities to 


Research 
expenditure 


2.4% 1.6% 
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< 97.6% 


become truly meritocratic, they must develop 
new ways of assessing faculty members to 
reward valuable research, teaching and out- 
reach. And for this to happen, governments 
must give universities more autonomy. 


QUANTITY AND QUALITY 

Our task force gathered data on science 
production for the 20 OIC countries that 
together have represented more than 90% 
of OIC scientific productivity over the past 
two decades. From the period 1996-2005 to 
2006-15, most countries doubled or tripled 
their production of science papers. Qatar’s 
output rocketed by a factor of 7.7, and Iran’s 
by 7.6. But the number of scientific papers 
produced remains below the average of coun- 
tries with similar GDP per capita. We found 
an average of 4.2 papers per dollar of GDP per 
capita for our OIC sample in the most recent 
decade, compared to an average of 8.6 for 
a group of 4 peer countries such as Brazil, 
Spain, South Korea, South Africa and Israel 
(see Supplementary 


Information; go.nature. “Science 

com/4o3itm). classes 
Papers from these themselves 

OIC countries are havesorida 


cited less frequently 
than those from other 
nations. The average 
was 5.7 citations per paper for 2006-15, 
compared with 9.7 for South Africa and 13.8 
for Israel, countries with a comparable GDP 
per capita. A list of the 100 most-cited papers 
since 1900 has none with a lead author from 
a Muslim-majority nation (see Nature 514, 
550-553; 2014). 

Scientific research must be relevant and 
responsive to society’s intellectual and prac- 
tical needs. This dual goal seems to be out of 
sight — and often out of consideration — for 
most academic institutions in the region. 

For scientists and engineers to be creative, 
innovative and able to engage with ques- 
tions of ethics, religion and the wider social 
purpose of research, students must receive 
a broad, liberal-arts-style education’. A few 


problems.” 
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institutions attempt to relate their students’ 
learning to their cultural backgrounds and 
contemporary knowledge. In the early 1970s, 
Tehran's Sharif University of Technology 
began a rich programme melding Islamic his- 
tory, philosophy and culture with science and 
engineering. Its graduate programme in the 
philosophy of science remains the only one 
in the OIC that we are aware of. It is perhaps 
no coincidence that the most recent Times 
Higher Education world university rankings 
named Sharif University as the top Iranian 
university and number eight in the OIC. 

In recent years, US-style liberal-arts estab- 
lishments have been set up in the region, 
modelled on the long-running and respected 
American University of Beirut and the Amer- 
ican University in Cairo. One such is the 
American University of Sharjah (AUS) in the 
United Arab Emirates (UAE), which this year 
ranked seventh in the QS Rankings of univer- 
sities in the 22 Arab countries. Fully home- 
grown and self-funded and with no formal 
affiliation with a US institution, the AUS 
requires science and engineering students to 
take roughly one-third of their required 40 
or so courses in humanities, social sciences, 
language and communication. 

Habib University, founded last year in 
Karachi, Pakistan, also follows this model. 
Here, science and engineering students must 
take courses such as “Understanding Moder- 
nity’ and “‘Hikma 1 & 2’ — a two-course 
sequence that translates as ‘traditional wis- 
dom’ — as well as many others that seek to 
create rounded rather than narrow engineer- 
ing and science professionals. Other educa- 
tional establishments should follow suit. 


CURRICULAR INNOVATION 

Science classes themselves have serious prob- 
lems. The textbooks used in OIC universities 
are often imported from the United States 
or Europe. Although the content is of a high 
standard, they assume a Western experience 
and use English or French as the language of 
instruction. This disadvantages many stu- 
dents, and creates a disconnect between their 
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Science students at the American University of Sharjah must take humanities and social-science courses. 


education and culture. To encourage the pro- 
duction of higher-quality, local textbooks and 
other academic material, universities need to 
reward staff for producing these at least as 
much as they do for research publication. 

Some basic facts are seen as controversial, 
and marginalized. Evolution, for example, is 
usually taught only to biology students, often 
as “a theory’, and is rarely connected to the 
rest of the body of knowledge. One ongoing 
study has found, for example, that most 
Malaysian physicians and medical students 
reject evolution (see go.nature.com/38cswo). 
Evolution needs to be taught widely and 
shown to be compatible with Islam and its 
culture’, Teaching the philosophy and history 
of science would help, too. 

The global consensus is that enquiry-based 
science education fosters the deepest under- 
standing of scientific concepts and laws. But 
in most OIC universities, lecture-based teach- 
ing still prevails. Exceptions are rare. One is 
the Petroleum Institute, an engineering uni- 
versity in Abu Dhabi, UAE, where the faculty 
has created a hands-on experience with posi- 
tive results on student interest and enrolment, 
particularly of women. 

Another problem is that faculty members 
rarely — if ever — receive any training or 
evaluation in pedagogy. This is true elsewhere 
in the world, but change is harder in many 
OIC nations. In most, curriculum changes, 
faculty appointments and promotions are set 
by ministry rules and decided by centralized 
commissions and bureaucracies. This leaves 
little room for universities to innovate. 


THE WAY FORWARD 

Universities in OIC nations need to be 
granted more autonomy to transform 
themselves into meritocracies that strive 


for scientific excellence and then lead rather 
than follow the winds of change towards 
greater transparency and meritocracy 
within their societies. 

Universities need to promote the right 
metrics, so that they do not inadvertently 
encourage plagiarism and junk science 
through pressure to publish. The region needs 
consistent data on science student and faculty 
profiles, curricula, pedagogy, language of 
instruction and so on, akin to what the Insti- 
tute of Statistics of the United Nations Educa- 
tional, Scientific and Cultural Organization 
collects — but at a fine-grained, university 
level. This is a task that must be undertaken 
by national or transnational bodies, such as 
the Islamic World Academy of Sciences (IAS) 
or the Islamic Educa- 
tional, Scientific and 
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disciplinary, explora- 

tory science education. A good start would 
be training for university teachers, with 
workshops on new tools and approaches. 
Barriers need to be broken between depart- 
ments and colleges and new programmes 
constructed. Professors need to be free to 
teach topics that are not tightly regulated by 
ministries. 

There are grassroots efforts across the 
Muslim world to stimulate curiosity about 
science among students of all ages, operat- 
ing without much government support. 
Ahmed Djebbar, an emeritus science histo- 
rian at the University of Lille in France, has 
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constructed an online, pre-university-level 
course called “The Discoveries in Islamic 
Countries’ available in three languages’, 
which relates science concepts to great 
discoveries and stories from the Islamic 
Golden Age. Such courses should be scaled 
up and shared by many institutions. 

Universities will need to implement 
reforms individually. We hope that the 
inspiration from a few islands of excel- 
lence will, in time, turn the tide of public 
and political opinion. There is precedent. 
In Pakistan, two private universities estab- 
lished in the 1980s — the Aga Khan Uni- 
versity and Hospital in Karachi and Lahore 
University of Management Sciences — revo- 
lutionized medical and business education 
within a decade of their creation. Students 
elsewhere began demanding the standard 
set by these educational pioneers. The same 
can be done for science. 

Our task force is putting out an open call 
for universities across the Muslim world 
to join a voluntary Network of Excellence 
of Universities for Science (NEXUS), to be 
launched early next year. This peer group 
will be managed by the task force and 
housed in science adviser Zakri’s office. We 
plan for NEXUS to run summer schools for 
university administrators, to monitor the 
progress of reforms at participating univer- 
sities, and to issue a peer report card that 
will assess the performance of the universi- 
ties in meeting milestones, thus recognizing 
and inspiring further improvements. True 
transformation will require much broader 
action from ministries, regulators and fund- 
ing agencies, and these may be the most 
resistant to change. 

Without tough reforms, the dream of a 
scientific revival in the Muslim world will 
remain just that. = 


Nidhal Guessoum is professor of physics 
and astronomy at the American University 
of Sharjah, United Arab Emirates. 
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e-mails: nguessoum @aus.edu; 
athar.osama@gmail.com 


1. Royal Society. A New Golden Age? The Prospects 
for Science and Innovation in the Islamic World 
(Royal Society, 2010). 

2. Royal Society. The Atlas of Islamic World Science 
and Innovation (Royal Society, 2014). 

3. Martin, M. O., Mullis, |. V. S., Foy, P. & Stanco, 

G. M. TIMSS 2011 International Results in Science 
(TIMSS & PIRLS International Study Center, 
2012). 

4. OECD. PISA 2012 Results in Focus (OECD, 2014). 

5. Bloom, D. E. & Rozovsky, H. Liberal Educ. 89, 
16-23 (2003). 

6. Guessoum, N. /slam’s Quantum Question: 
Reconciling Muslim Tradition and Modern Science 
(IB Tauris, 2011). 

7. Djebbar, A. The Discoveries in Islamic Countries 
(Editions le Pommier, 2009). 


AMERICAN UNIV. SHARJAH 


ROBERT DOISNEAU/GAMMA-RAPHO VIA GETTY 


The Funeral Procession of the Jaguar: a technician transports a specimen in 1943. 


Sedition in 
the stores 


Laura Spinney extols Robert Doisneau’s haunting images 
of the Paris natural history museum under occupation. 


Doisneau (perhaps best known for 

his image of a couple kissing outside 
the Hotel de Ville) was commissioned to 
record life behind the scenes at the various 
arms of the National Museum of Natural 
History (MNHN) in Paris. Most of the 
images have never been published. They 
are a unique document of the work of a 
research institute in occupied France dur- 
ing the Second World War. Now, a small 


IE 1942, French photographer Robert 


jewel of an exhibition brings them out 
of the stores where they were taken, and 
places them in the limelight where they 
belong. 

Doisneau was a member of the French 
Resistance; he produced fake identity 
papers for his comrades-in-arms. The 
commission to photograph two Paris 
museums, the botanical gardens and 
a zoo that are part of the MNHN, was 
offered to him by the influential publisher 
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Robert Doisneau: Maximilien Vox 


APhotographerat (real name, Samuel 
the Museum Monod), acting 
French National on behalf of the 
Museum of Natural ; 

: : Vichy government. 
History, Paris. , 
Until 18 January 2016. Sympathetic to the 


Germans, this pup- 

pet regime wanted to 
vaunt the vitality of French intellectual life 
under its beneficent new rulers. 

Why did Doisneau agree to such a dubi- 
ous assignment? Recently returned from 
the army, he probably just needed the cash. 
His first baby had only recently been born, 
and a commission from Vox was not some- 
thing that a young photographer turned 
down. Furthermore, lauding France’s 
academic excellence need not have struck 
him as a betrayal. 

What Doisneau found as he toured 
the museums and gardens was a vibrant 
research institute — despite, rather than 
because of, the intrusion of world events. 
Paul Rivet, director of the MNHN’s 
Museum of Man, was in exile in Colombia. 
Others had just returned from military 
service or prisoner-of-war camps — 
including the palaeontologist Camille 
Arambourg, now remembered for defend- 
ing Neanderthals against accusations of 
simian brutishness. 

A demobilized botanist, André Guil- 
laumin, was searching for coal to heat the 
vast greenhouses. A major effort was under 
way to reorganize the collections, which 
were just starting to return, having been 
evacuated in 1939. Publication had been 
slowed but not stopped by the censors. At 
the Museum of Man, a resistance cell had 
been dismantled and its members executed 
or deported. 

Emerging from such tensions, the 
images take on extra significance. Dois- 
neau wrote later that he was struck by the 
contrast between the moment of history he 
inhabited — of which his growling stom- 
ach served as a constant reminder — and 
the geological epochs spanned by the col- 
lections. He used that contrast to powerful 
effect. 

Some of the images seem downright 
insolent, such as that of Paul Budker 
of the Laboratory of Fish and Colonial 
Animals gazing into a jar of baby sharks 
— a Frenchman inspecting imprisoned 
predators. Others find a wistful wisdom 
in scenes from the museums daily life. 
One such is The Funeral Procession of the 
Jaguar: the beast is pushed in a wheelbar- 
row over cobbles to the taxidermy depart- 
ment. Another, showing a white-coated 
woman with a wizened corpse on her hip 
and a faraway look in her eyes, Doisneau 
entitled The Surprising Lightness of a Peru- 
vian Mummy. 

Vox had envisaged a collection 
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> called The Face of Science. Overtaken 
by events, it never saw the light of day. In 
November 1942, Allied forces landed in 
North Africa, prompting the Germans to 
invade previously unoccupied southern 
France, rendering the Vichy government 
impotent. The photos were consigned to 
the museum library. 

In 1990, the museum invited Doisneau 
back to complete his project. This post- 
script was a good idea: the contrast 
between the two sets of photos speaks 
volumes. Doisneau was in his late sev- 
enties and famous. The museum, too, 
had changed, and Doisneau delighted in 
discovering its three new subterranean 
floors of storage. The later images are as 
closely observed as the earlier ones. But 
now — as in a picture of a stuffed gorilla 
ina lift, emerging from or descending into 
the museum's bowels — the irony is less 
loaded, and the delight floats free. m 


Laura Spinney is a writer and science 
journalist based in Paris. 
e-mail: Ifspinney@gmail.com 


Paul Budker of the Laboratory of Fish and Colonial Animals with a jar of baby sharks in 1943. 
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SCIENTIFIC METHOD 


Tales of the unexpected 


Stuart Firestein relishes Helga Nowotny’s study of uncertainty in science and society. 


or scientists, uncertainty is a norm. 
2 Experiments begin with uncertainty 

(why else do them?), and even when 
they are ‘successful’ the results contain 
only a range of certainty and a range of 
confidence about that certainty. Yet in the 
world outside the laboratory, uncertainty 
is perceived as negative — nota data point, 
but a failing, effectively no better than not 
knowing. 

Social scientist Helga Nowotny, former 
president of the European Research Coun- 
cil, has written The Cunning of Uncertainty, 
a tour of the phenomenon and its value — 
to the individual researcher, to the infra- 
structure of research and to society. Coping 
with uncertainty, Nowotny declares, must 
become a collective achievement. Otherwise, 
the scientific community risks becoming an 
elite, with all the suspicion and mistrust that 
that engenders. 

As Nowotny shows in numerous 
examples from the social sciences, his- 
torical literature and current media, soci- 
ety’s misunderstanding of uncertainty has 
already led to confusion, distortion and 
politicization of science (in the debates 
on tobacco and on climate change, for 
instance). Billions of dollars of public and 


private money are poured into research, 
largely on the false assumption that science 
provides cold, hard, immutable facts. This 
attitude is reinforced by an educational sys- 
tem that treats science as an immense ‘fact 
tract’ to be memorized (and then largely 
forgotten), producing a populace that 
believes science is about answers, rather 
than questions. 

Nowotny deserves high praise for bring- 
ing a discussion of the uncertainty around 
uncertainty into the public arena. She does 
so with remarkable aplomb given the sub- 
ject’s slipperiness. She challenges scientists 
to take a leading role in setting straight the 
distorted views of certainty in science, 
and then spreading 
the word to educa- 
tional and political 
policymakers. 

I was puzzled at 
first by Nowotny’s 
use of “cunning”, 
a term denoting a 
worrisome crafti- 


ness — cleverness +). Cunning of 
mixed with predatory Uncertainty 
wiliness. However, —HELGANOWOTNY 
Nowotny has nailed Polity: 2015. 
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it: uncertainty has a duality. It is a space 
that allows creativity, but it is fraught with 
insecurity. Nowotny herself struggles to dis- 
entangle uncertainty from the unexpected, 
the unpredictable, the dangerous. She some- 
times fails, but the exercise is instructive. 
We take too much for granted, she shows, 
in defining uncertainty; there are many 
subtleties and layers to it. From antiquity, 
people used prognosticators and magicians 
to look into the future; futurologists are still 
consulted. Yet people insist on ‘spoiler alerts 
before sports results are announced, or in 
film or book reviews; and few want to guess, 
much less know, the exact moment and 
circumstances of their too-certain deaths. 
Definitely cunning. 

Nowotny examines this tension through 
historical examples of how policy is set in 
areas of uncertainty. These are: reproductive 
technologies such as in vitro fertilization; 
stem-cell science; and personalized medi- 
cine. With the exception of the last example, 
these seem like old stories. A discussion of 
more current and contentious policies, such 
as genetically modified organisms or nuclear 
power, might have been more instructive. 
However, her analyses of big-data pro- 
grammes (the trumpeted then debunked 


Google Flu Trends is an excellent example) 
and nonlinear complex systems (such as 
finance, energy or pandemics) as gushers 
of uncertainty and contemporary spaces 
of innovation make for deep and fruitful 
reading. 

Although she gives us chapter headings 
like those ofa self-help book (‘Craving for 
certainty, “The odds for tomorrow’), Now- 

otny refuses to be 


“Treating ere ply _— ‘i 
science asa cia neheay al 
é ; ume. Rather, she 
‘fact tract’ to ; 
F presents us with 
be memorized id A 
duces many ideas an 
ee , numerous angles 
ro ie fen to chew over. My 
c at eneves personal favourite 
science is about 


is her tracking of 
answers, rather public attitudes to 


than ques tions.” risk, from viewing 

it as both nega- 
tive and positive to seeing it as an almost 
completely negative factor that must be 
reduced. Her ideas about positive risk and 
why it must be increased offer a refreshing 
perspective in this compliance-oriented 
world. 

This is, above all, a book of ideas, not 
a policy manual, even though Nowotny 
would obviously like to see changes in fund- 
ing, economic and management policies. 
She is well-equipped to lead us in this battle; 
throughout an illustrious academic career, 
she has served in numerous policymaking 
positions, for example in the European Sci- 
ence Foundation. And that is perhaps the 
one thing that I miss in this book: a more 
personal tour of uncertainty with a traveller 
who has come up against it in policy, fund- 
ing, education and research. It is a shame 
that Nowotny does not occasionally put 
down the careful scholarly pen and take up 
the memoirist’s. 

Asa Scientist, 1am at home with uncer- 
tainty. I like that the gathering of knowledge 
inevitably reveals new and unexpected bits 
of a vast unknown akin to the dark matter 
of epistemology. The real cunning of uncer- 
tainty lies in how it increases through every 
attempt to reduce it. 

As for certainty, wherever you find it, you 
can be sure that a demagogue or dictator is 
nearby. French author and Nobel laureate 
André Gide advised that we should believe 
those who seek the truth — and doubt those 
who claim to have found it. This is one 
prescription that Nowotny courageously 
follows. = 


Stuart Firestein is the former chair of 

the Department of Biological Sciences at 
Columbia University in New York City. His 
latest book is Failure: Why Science Is So 
Successful. 
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Books in brief 


Phishing for Phools: The Economics of Manipulation and 
Deception 

George A. Akerlof and Robert J. Shiller PRINCETON UNIV. PRESS (2015) 
In this acerbic dissection of free-market economics, Nobel- 
prizewinning economists George Akerlof and Robert Shiller trash 
the “invisible hand” theory, which claims that self-interest promotes 
social benefits. They reveal market economies as rife with trickery — 
“ohishing” luring “phools” to make poor choices. The two pool their 
economic wisdom to analyse arenas from food buying and politics 
to the financial crisis that has plagued us since 2008. A needed call 
for sceptical economics and financial mindfulness. 


On the Wing: Insects, Pterosaurs, Birds, Bats and the Evolution of 
Animal Flight 

David E. Alexander OXFORD UNIV. PRESS (2015) 

How do dragonflies, vultures or fruit bats fly? Biomechanics specialist 
David Alexander reveals all in this study of flight and the evolution of 
wings in pterodactyls, bats, birds and insects. Aloft with Alexander, 
we learn how pilots have observed phalanxes of swans flying at 
8,000 metres, and how the gargantuan pterosaur Quetzalcoatlus 
northropi probably got off the ground. Alexander analyses aerial 
predation, sex, combat, sleep and even egg-laying; picks at the 
puzzle of bats’ evolutionary relationships; and much more. 


American Zoo: A Sociological Safari 

David Grazian PRINCETON UNIV. PRESS (2015) 

Cultural sociologist David Grazian once studied urban nightlife. 
Turning to another brand of contained wildness, he immersed himself 
in zoos. As a volunteer, he clipped a ferret’s toenails, bathed tortoises 
and logged “more working hours of animal husbandry and faecal 
cleanup” than most professors can boast. His trek through 26 US zoos 
has yielded a powerful portrait of these conservation-hotspots-cum- 
living-labs — which end up telling us more about ourselves than about 
the animals. Peppered with delicious details, such as one zoo’s use of 
the film Austin Powers for animal “enrichment”. 


The Quotable Feynman 

Edited by Michelle Feynman PRINCETON UNIV. PRESS (2015) 

The Nobel-prizewinning, bongo-playing, exuberant and brilliant 
physicist Richard Feynman died in 1988. His contributions to 
science (including the theory of quantum electrodynamics) and 
science popularization ensure a lasting fame (R. Phillips Nature 504, 
30-31; 2013). His daughter Michelle has mined interviews, articles, 
books and lectures for this collection of quotes on everything from 
poetry to politics. Feynman’s depth and zing leap from the page, as 
in: “What! am trying to do is bring birth to clarity, which is really a 
half-assedly thought-out pictorial semi vision thing.” 


Cosmos: The Infographic Book of Space 

Stuart Lowe and Chris North AURUM (2015) 

Infographics remain on a roll, offering visual insight into abstruse 
regions of human knowledge. Astronomers Stuart Lowe and Chris 
North really lift off in this graphic exploration of all things space. 
Their depictions of year-by-year spaceflights and space junk 
shock through sheer numbers, while their takes on interplanetary 
missions, moons in the Solar System and particularly the 
polarization of the Milky Way — like a cosmic finger-painting in 
palest mauve — deliver the facts with aesthetic brio. Barbara Kiser 
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Correspondence 


Engage the public to 
stop bear trafficking 


Last month, authorities in China 
confiscated 100 kilograms of bear 
paws. Many of the animals had 
been massacred in Russia, then 
the paws smuggled into China. 
Neither country is managing to 
control this monstrous trade. 
Bear paws are a coveted 
delicacy in China, fetching up to 
ten times the price that they do 
in Russia. This is despite some 
species having a second-class 
protected status under Chinese 
law, with bear hunting and the 
sale and eating of bear products 
strictly prohibited in the country. 
The Chinese government must 
intensify its efforts against wildlife 
trafficking. Extensive publicity 
and public education will be the 
most effective way to kill demand 
for wildlife delicacies. 
Zhengrong Yuan, Yingying 
Han, Qiang Weng Beijing 
Forestry University, China 
giangweng@bjfu.edu.cn 


China Nobel stirs up 
attack on academies 


The first Nobel prize to be 
awarded to a researcher in 
China has sparked heated debate 
in the country (see Nature 
526, 174-175; 2015). This 
centres on the public's long- 
term dissatisfaction with the 
Chinese academy system, which 
consistently failed to recognize 
the scientific talent and originality 
of the prizewinner Youyou Tu. 
Chinese news reports about 
Tu’s award on 5 October drew 
attention to her ‘three-no’ status 
(that is, no overseas experience, 
no doctorate, no admission 
to a national academy — a 
status known as yuanshi). 
Citizens are indignant that 
her yuanshi application was 
repeatedly declined, despite her 
recommendation by the minister 
of health and her acclaim by 
international scientific bodies. 
The unfairness of the yuanshi 
selection process is widely blamed. 
on cronyism, nepotism and 


excessive bureaucracy (see, for 
example, C. Cao et al. Science 341, 
460-462; 2013). China’s academic 
institutions need to reform their 
moribund practices rapidly or 
they are likely to overlook other 
striking individuals and key 
grass-roots research in the future. 
Xin Miao Harbin Institute of 
Technology, Harbin, China. 
xin.miao@aliyun.com 


Badger-cull targets 
unlikely to reduce TB 


Two months ago, the 
government-advice body Natural 
England approved further 
licensed badger culls in parts of 
the United Kingdom in 2015. 
The aim is to reduce local badger 
densities by at least 70% to 
prevent the spread of tuberculosis 
(TB) to cattle (see go.nature. 
com/iiutvj). On the basis of the 
government's badger-population 
estimates, we calculate that these 
culls are unlikely to achieve the 
necessary reduction. 

The latest minimum cull 
numbers derive from the lower 
95% confidence bounds on 
population size estimates. For 
example, licensees in Dorset 
are required to kill at least 
615 badgers in a population 
estimated at 879-1,547 animals 
(95% confidence interval). Killing 
this number would give an 
estimated population reduction 
of between 39.8% and 70% 

(95% confidence interval). 

Equivalent confidence intervals 
for the 2015 (third annual) 
Somerset and Gloucestershire 
culls are, respectively, 50.8-70% 
and 54-70% relative to the 
baseline population estimates. It 
is therefore unlikely that a 70% or 
greater reduction can be attained 
by these minimum cull numbers, 
assuming that the population 
estimates are accurate. 

Evidence from a randomized 
controlled trial shows that better 
prospects for the control of 
cattle TB are offered by badger 
populations that are either 
reduced by more than 70% or left 
undisturbed — and potentially 
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vaccinated (C. A. Donnelly et al. 
Nature 439, 843-846; 2006). The 
choice depends on a range of 
epidemiological, economic, social 
and ecological factors. 

Christl A. Donnelly Imperial 
College London, UK. 

Rosie Woodroffe Institute of 
Zoology, London, UK. 
c.donnelly@imperial.ac.uk 
Competing financial interests 
declared; see go.nature.com/so3gvl. 


EU report advises on 
contentious research 


The United States has a de facto 
moratorium on genetic gain-of- 
function experiments that could 
increase the transmissibility 
or pathogenicity of potentially 
pandemic agents such as the 
H5N1 avian influenza virus. In 
Europe, opinion among scientists 
is divided on the benefits 
and risks of such research for 
policymakers. A new report on 
these differences by the European 
Academies Science Advisory 
Council (EASAC; see go.nature. 
com/jcdy2w) will help to inform 
scientists and the public on this 
globally controversial research. 
As well as risk—benefit 
assessment, the report addresses 
concerns such as scientific 
responsibility, research review and 
management systems, options 
for national and international 
biosafety and biosecurity advisory 
bodies, and the publication 
of sensitive information. It 
also highlights areas in which 
European Union regulations and 
best-practice guidelines need 
further consideration, notably 
those affecting the publication 
and export of research findings. 
We recommend an integrated 
approach to biorisk assessment 
and management, with 
responsibilities and action shared. 
among researchers, institutions 
and funders. 
Robin Fears, Volker ter Meulen 
EASAC, German National 
Academy of Sciences Leopoldina, 
Halle, Germany. 
volker.termeulen@mail.uni- 
wuerzburg.de 


Make raw emissions 
data public in China 


China's carbon emissions need 
to be estimated more accurately 
if the country is to meet its 
climate targets and participate 
in anationwide emissions- 
trading scheme after 2017 (see 
E Teng Nature 525, 455; 2015). 
Accounting errors will prevail 
until the nation’s data sources are 
made transparent, measurable 
and verifiable. 

For example, Teng points 
to discrepancies between our 
estimations of coal emissions 
(Z. Liu et al. Nature 524, 335-338; 
2015) and those calculated 
by the government's data 
agencies — from information 
that is not publicly available. The 
comparison is invalid, however. 
Our estimates are derived 
from energy consumption and 
measured emission factors for 
coal acquired from 4,845 mines, 
rather than from final energy 
consumption and estimated 
emission factors averaged over all 
coal types. 

Emissions estimates are even 
more uncertain in economies 
such as India, Indonesia and 
South Africa. China’s latest 
investment of 20 billion yuan 
(US$3.1 billion) for climate- 
change mitigation and adaptation 
in developing countries could 
help these nations to compile 
measurable, accurate and 
accessible carbon emissions data. 
Dabo Guan University of East 
Anglia, Norwich, UK. 

Zhu Liu Harvard University, 
Cambridge, Massachusetts, USA. 
Wei Wei Shanghai Advanced 
Research Institute, Chinese 
Academy of Sciences, Shanghai, 
China. 

dabo.guan@uea.ac.uk 


CORRECTION 

An editing error in the 
Correspondence by A.C. Lewis 
et al. (Nature 526, 195; 2015) 
attributed 29,000 UK deaths to 
particulate matter from diesel, 
rather than from all sources. 


BRIEF COMMUNICATIONS ARISING 


El Nino and intense tropical cyclones 


ARISING FROM F.-F. Jin, J. Boucharel & I.-I. Lin Nature 516, 82-85 (2014); doi:10.1038/nature13958 


The El Nino/Southern Oscillation (ENSO) influences global climate 
as well as extreme weather events such as floods, droughts, and trop- 
ical cyclones, leading to large societal impacts globally’~. Jin et al.* 
have shown that El Nifo—the warm phase of ENSO—effectively 
discharges oceanic heat into the central to eastern North Pacific basin 
through the subsurface ocean after its wintertime peak, resulting in 
high tropical cyclone activity during the following tropical cyclone 
peak season in the eastern North Pacific, which has significant impli- 
cations for seasonal prediction of tropical cyclone activity in the east- 
ern North Pacific. However, we question the robustness of their 
hypothesis on the following grounds: (1) the correlation between 
subsurface ocean heat delivered by El Nifio and tropical cyclone activ- 
ity is statistically exaggerated; and (2) wintertime ENSO conditions, 
which are claimed to have predictive value, are not strongly correlated 
with tropical cyclone activity during the subsequent summer. 
These factors imply that the hypothesis of ref. 4 has limitations in 
practical seasonal tropical cyclone prediction. There is a Reply to this 
Brief Communication Arising by Jin, F.-F., Boucharel, J. & Lin, I.-L 
Nature 526, http://dx.doi.org/10.1038/nature15547 (2015). 

Our first concern is the data smoothing used in their correlation 
analysis (figure 3 and supplementary figures 10 and 12 of ref. 4). In 
general, smoothing is a useful tool for removing high-frequency 
noises in order to focus on long-term variations, but smoothing can 
also tend to increase correlations artificially (http://climateaudit.org/ 
2008/02/12/data-smoothing-and-spurious-correlation/). Jin et al.* 
used a three-year smoothing, which is a suitable technique when 
the physical variations being examined are multiannual. However, 
the use of three-year smoothing is not appropriate in this case because 
ref. 4 examined interannual variations of tropical cyclone activity, 
focusing on interseasonal connections between wintertime ENSO 
and summertime tropical cyclones. It turns out that the smoothing 
significantly increased the correlation between subsurface ocean heat 
delivered by El Nino (based on the principal component of the second 
empirical orthogonal function mode in ref. 4, PC2) and tropical cyc- 
lone activity (accumulated cyclone energy, ACE*) from 0.29 to 0.62 
(see Fig. 1a and figure 3b of ref. 4). The smoothing also enhanced the 
correlation of a bilinear regression model of ref. 4 (see figure 3e of 
ref. 4) from 0.37 to 0.64 (not shown). 

A second concern is their way of comparing tropical cyclone activ- 
ity between high and low subsurface heat content (PC2 active) peri- 
ods. Jin et al.* showed a surprisingly distinct difference in the number 
of intense tropical cyclones between the two groups (see figure 2, 
tables 1 and 2, and supplementary figures 5-9 of ref. 4), but the results 
were overstated by unequal sampling. Jin et al.* sampled 43 months 
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PC2 summer average Nino3.4 winter average 


Figure 1 | Relationships of subsurface ocean heat delivered by El Nifio 
(PC2) and the ENSO signal (Niiio3.4) with tropical cyclone activity in the 
eastern North Pacific. a, Scatter plot of ACE against PC2 with smoothed (red) 
and unsmoothed (blue) data. b, Scatter plot of ACE against the Nifo3.4 
index. ACE and PC2 are averaged over the boreal summer and autumn (June to 
November) and Nifio3.4 is averaged over the ENSO peak period (December 
to February) before the summer tropical cyclone season. Linear regressions 
(straight lines) and correlation coefficients r are shown. All data sets are 
detrended, and are unsmoothed except for the smoothing discussed above. 
For consistent comparisons with ref. 4, we used the same analysis period 
(1970-2009), methods, and data as in ref. 4, except for the smoothing. The 
subsurface ocean heat over the upper 105 m (T105) represents all the existing 
subsurface ocean conditions in the eastern North Pacific, whereas PC2 explains 
only 56% of the T105 variations through a process delivered by El Nino. 
Therefore, PC2 is the index explaining the relationship between subsurface 
ocean heat delivered by El Nifo and tropical cyclone activity in ref. 4, although 
T105 is more closely related to tropical cyclone activity than PC2. 


for high-heat-content periods, but only sampled 25 months for low- 
heat-content periods. The monthly distributions of the sampled 
month were also different. For example, in the climatological lowest 
(June) and highest (August) tropical cyclone genesis months for the 
eastern North Pacific season, the numbers of selected months were 8 
(19% of total) and 7 (16%) for high-heat-content periods, but 8 (32%) 
and 2 (8%) for low-heat-content periods, respectively (Table 1). Using 
this unequal sampling, the total numbers of tropical cyclones are 
compared for the selected months of the two groups. This is an unfair 
comparison, leading naturally to a higher number of tropical cyclones 
in the high-heat-content periods. An impartial comparison should 
examine the differences in terms of mean values for each month 
rather than the total number of tropical cyclones. Our monthly 
comparison with mean values revealed that there are no significant 
differences in intense tropical cyclone numbers between the high- 


Table 1| Monthly and summer mean numbers of intense tropical cyclones during periods of low and high subsurface heat content 


Monthly mean numbers of intense tropical cyclones (number of selected months) 


Summer-mean intense tropical 


Periods Total number of 
selected months 
June July August 
High PC2 43 0.50 (8) 1.38 (8) 1.14 (7) 
Low PC2 25 0.25 (8) 1.14 (7) 1.00 (2) 
Climatology 0.31 0.86 0:90 
Confidence level for difference 67 27 8 


between periods (%) 


cyclone numbers 


September October November 
1.43 (7) 0.86 (7) 0.00 (6) 5.30 
1.00 (2) 1.00 (2) 0.00 (4) 4.39 
0.75 0.43 0.00 3.25 
SL 16 6) 67 


Intense (category 3, 4 and 5) tropical cyclones represent tropical cyclones at category 3 or above according to the Saffir-Simpson hurricane scale. Input data and methods are the same as in supplementary table 1 
of ref. 4 except that we used monthly mean values. Owing to the different monthly distributions in sampling between two periods, summer-mean intense tropical cyclone numbers are calculated by using 

the sum of monthly mean tropical cyclone numbers, not the total summer tropical cyclone number divided by the total number of selected months. This method has the advantage of not being influenced by the 
months without tropical cyclones (such as November). High (low) PC2 indicates a period of high (low) subsurface heat content, which is based on the principal component of the second empirical orthogonal 


function mode in ref. 4. 
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heat-content and low-heat-content periods based on PC2 (all confid- 
ence levels are less than 67%, Table 1). 

A third concern is the limitation in the seasonal prediction of 
tropical cyclone activity. Jin et al.* argued that observed ENSO signals 
(the Nifio index) in the winter are good indicators of tropical cyclone 
activity during the subsequent summer in the eastern North Pacific. 
However, the correlation between the Nifo index in the winter and 
ACE during the subsequent summer is very low (r = 0.18, Fig. 1b), 
which implies that the subsurface ocean heat delivered by El Nino has 
very little contribution (~3%) to the total variations of tropical cyc- 
lone activity in the subsequent summer. Consequently, ref. 4’s theory 
has limitations in practical seasonal tropical cyclone prediction. 

In spite of these concerns, we agree that the delivery of subsurface 
ocean heat for specific big El Nino events does have an influence on 
tropical cyclone activity in the eastern North Pacific. In these cases, 
there is no doubt that ocean thermal control can be used as a predictor 
for seasonal prediction of tropical cyclones in the eastern North 
Pacific. However, we consider that ref. 4’s claims of a direct connec- 
tion between the subsurface ocean thermal control delivered by El 
Nino and tropical cyclone activity are premature for general ENSO 
events. The connections are not robust enough to apply for the sea- 
sonal prediction for all types of ENSO events. An analysis classified 
according to the types of ENSO (El Nino or La Nina; cold tongue or 
warm pool El Nino®”) will be required to improve the correlations and 
reliability of prediction. 
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REPLYING TO I.-L. Moon, S.-H. Kim & C. Wang Nature 526, http://dx.doi.org/10.1038/nature15546 (2015) 


Observational and modelling studies suggest that subsurface ocean 
temperature plays a major part in tropical cyclone intensification’”. In 
a recent Letter’ we reported that through the El Nino/Southern 
Oscillation (ENSO) recharge-discharge mechanism, the subsurface 
heat of ENSO can directly affect intense tropical cyclones in the 
eastern Pacific®*. In the accompanying Comment”, Moon et al. ques- 
tioned the robustness and relevance of our results. 

First, ref. 5 questioned our use of a three-year running mean. 
However, this simple smoothing is actually physically relevant. It 
effectively filters out the ‘noise’ from warm pool El Nifo events’, 
which do not effectively discharge heat into the eastern Pacific and 
thus have little control on eastern Pacific tropical cyclone intensity’. 
By focusing on the slow and strong events, we delineated a clear 
physical mechanism: canonical ENSO events, which mostly have sig- 
nificant discharge of heat, exert a strong oceanic control on tropical 
cyclone activity in the eastern Pacific. However, we acknowledge that 
we should have explicitly indicated this point in ref. 3 to avoid con- 
fusion. Furthermore, it should be emphasized that although not every 
ENSO event delivers a great amount of heat to the eastern Pacific 
region, subsurface heat anomalies do have a strong impact on the 
overlying tropical cyclone activity. Using unfiltered data (that is, with- 
out the three-year running mean), a strong correlation (r= 0.62) 
between monthly subsurface ocean heat over the top 105m (T105) 
of depth and monthly tropical cyclone intensity is still evident (Fig. 1), 
clearly supporting a strong relationship between subsurface heat 
and the eastern Pacific tropical cyclone intensity. We also note that 
the quality of both tropical cyclone and ocean data sets is problematic 
in the earlier period (1959-1978), as there exist few subsurface 
observations®, and there were no systematic satellite tropical cyclone 
observations before the 1980s’. Thus, these low-quality data sets can 


contribute to lower correlations in the earlier period (Fig. 1b). In 
summary, we do not agree with the contention of ref. 5 that the impacts 
of subsurface thermal control on tropical cyclone activity are largely 
exaggerated owing to the smoothing regardless of the period. 

Second, ref. 5 questioned the statistical significance of tropical cyclone 
activity changes between periods of high and low subsurface heat over 
the eastern Pacific region. Our original choice (ref. 3) ofa broad temporal 
measure in terms of simple accounts of tropical cyclone number per 
decade displays significant differences between these periods (periods of 
high and low subsurface heat are referred to as high PC2/low PC2; see 
ref. 3 for details). Moon et al.* argue that this significance is severely 
degraded when their ‘accurate’ counting of total number per month is 
used. However, we believe that their counting ignored one important 
fact: there are many months without any tropical cyclone (hurricane) 
occurrence in the record. We argue that those ‘hurricane-empty’ 
months should be removed for a truly accurate counting. If we use the 
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Figure 1 | Relationships between heat content and tropical cyclone 
intensity in the northeast Pacific. a, Correlation based on the unsmoothed 
data from 1979-2009. b, Correlation based on the earlier period (1959-1978). 
c, Correlation based on the entire period (1959-2009). 
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Table 1| Changes in tropical cyclones statistics between high- and low-heat-content periods in the northeast Pacific 


Number of sampled month 


Mean of the number of category 3-5 tropical cyclones 


Confidence level (two-tailed) 


93% 


Confidence level (one-tailed) 


96.5% 


Number of months in sample 


34 3.368 (>99.75% for one-tailed; 
>99.5% for two-tailed) 


t value (confidence level) 


24 


a High PC2 29 1.34 
Low PC2 21 0.76 
Mean (%) Variance 
b High PC2 11.24 139.02 
Low PC2 3.21 38.29 

a, Comparison of intense (category 3-5) tropical cyclone count between high and low PC2 active periods, using category 1 as a minimum threshold. 


b, t-test result for the percentage of intense tropical cyclone grid count—that is, grid count of category 3-5 tropical cyclones divided by the grid count of total tropical cyclones (from tropical depression to 


category 5)—in the tropical cyclone region. 


conventional definition of tropical cyclone (category 1 or above) to filter 
out tropical depressions and storms that have little interactions with the 
subsurface ocean, and then remove all ‘hurricane-empty’ months from 
the counting, the number of tropical cyclones per month again passes 
the 95% confidence level (Table 1a). Furthermore, using a stricter grid- 
by-grid approach, a precise measure of intense tropical cyclone activity 
giving the percentage of intense tropical cyclones over all tropical cyc- 
lone grids, we obtained an even stronger statistical significance (above 
the 99.5% confidence levels, see Table 1b). Both the broad measure in 
our earlier analysis and the stricter measures we present here give 
consistent results regarding the significance of this statistical test. 

Third, Moon et al.* questioned the usefulness of our finding in the 
prediction of tropical cyclones. It is known (and expected to continue 
to be so) that a major El Nifo event is followed by a highly predictable 
heat discharge into the eastern Pacific. For instance, as this year’s El 
Nino develops into a strong event likely to peak in the coming boreal 
winter in the equatorial eastern Pacific, there is a high chance (75%) of 
a significant heat discharge into the eastern north Pacific to fuel 
intense tropical cyclones when next year’s tropical cyclone season 
arrives. Should a weak warm pool El Nifo event occur, it would 
discharge little heat to affect the eastern Pacific tropical cyclone activ- 
ity. Information about each El Nino and its heat content discharge is 
known long before the Northern Hemisphere tropical cyclone season, 
so it has a clear predictive value. 

In conclusion, we reaffirm the physical mechanism evidenced by 
ref. 3, that is, the impact of ocean heat content discharged from major 
El Nino events on the eastern Pacific tropical cyclone intensity and its 
value for tropical cyclone predictions beyond the seasonal timescale. 


Methods 


This work uses single-tailed significance while ref. 5 uses double-tailed. If the 
mean of the first group is expected to be larger than the second, a single-tailed 
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approach is used. However, if the mean of the first group is different from the 
second in either direction, a double-tailed approach is used. In the context of this 
study, we expect the group 1 (high PC2) mean to be larger than the group 2 (low 
PC2) mean, so we used a single-tail approach. In the revised Table 1, we present 
results using both approaches. 
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Caspase target drives pyroptosis 


Inflammatory caspase proteins help to control pathogen replication by triggering pyroptotic cell death. It now emerges 
that cleavage of the caspase substrate gasdermin D is sufficient to induce pyroptosis. SEE ARTICLES P.660 & P.666 


PETR BROZ 


form of cell death, are key responses of the 

innate immune system to pathogens. To 
avoid excessive damage to the host organism, 
pyroptosis is tightly regulated by the activation 
of inflammatory caspase proteins, such as cas- 
pase-1 and caspase-11 — but how these cas- 
pases initiate this cell-death program has been 
unknown. In this issue, Shi et al.’ (page 660) 
and Kayagaki et al.” (page 666) show that 
caspase-mediated cleavage of the protein 
gasdermin D creates an amino-terminal frag- 
ment that has intrinsic pyroptosis-inducing 
activity. 

Caspases are a family of enzymes best known 
for controlling apoptosis, a well-characterized 
type of cell death that is vital for embryonic 
development and tissue homeostasis in adults. 
A subgroup, comprising caspase-1, -11 and 
-12 in mice and caspase-1, -4, -5 and -12 in 
humans, controls the inflammatory response 
to pathogens and noxious stimuli in the cyto- 
plasm of host cells. These inflammatory cas- 
pases are produced as inactive monomers and 
become activated when recruited to multi- 
protein complexes knownas inflammasomes’. 
‘Canonical’ inflammasomes activate caspase- 1 
and are assembled by cytoplasmic sensors, 
such as the NOD-like receptors, which detect 
diverse pathogen- and host-derived danger 
signals. By contrast, the bacterial cell-wall 
component lipopolysaccharide (LPS), one of 
the strongest immune-system activators, leads 
to the assembly of ‘non-canonical’ inflamma- 
somes’ through direct binding and activation 
of caspase-4, -5 or -11. 

Although the signals leading to inflamma- 
some assembly are quite well understood, the 
downstream signalling events that follow the 
activation of inflammatory caspases are poorly 
characterized. Initial work identified the pro- 
inflammatory cytokine protein IL-1 as a key 
substrate of caspase- 1 in the canonical inflam- 
masome pathway. Subsequent studies discov- 
ered that inflammatory caspases also induce 
a programmed cell-death pathway, which 
involves cell swelling, lysis and the release of 
cytoplasmic content, presumably as a result 
of the formation of membrane pores, and is 
thus morphologically distinct from apoptosis. 


[fetes and pyroptosis, a specialized 
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Figure 1 | Gasdermin D cleavage initiates pyroptosis during inflammasome activation. a, The non- 
canonical inflammasome pathway detects bacterial lipopolysaccharide (LPS) molecules in the cytoplasm 
of infected cells. Direct binding of LPS to the protein pro-caspase-11 causes the protein’s dimerization, to 
become active caspase-11. In human cells, caspase-4 and -5 perform the function of mouse caspase-11. 
Shi et al.' and Kayagaki et al.’ show that caspase-11 cleaves the protein gasdermin D, releasing its amino- 
terminal domain from inhibition by its carboxy-terminal domain. The N-terminal domain (directly 

or indirectly) drives pyroptosis, a cell-death process that causes cell lysis and the release of cytoplasmic 
content. b, The canonical inflammasome pathway is triggered by different cytoplasmic sensor proteins, 
which detect pathogens and inflammatory agents. These recruit pro-caspase- 1 monomers through the 
adaptor protein ASC and activate the caspase by dimerization. The findings of the two groups suggest 
that caspase-1 can also initiate pyroptosis by cleaving gasdermin D, although other pyroptosis-inducing 
caspase-1 substrates may exist. Caspase-1 also processes the pro-inflammatory cytokine pro-IL-16 to 
generate mature IL-1f, which is presumably released by cell lysis during pyroptosis. 


Because this type of cell death is intrinsically 
pro-inflammatory, it was named pyroptosis, 
from the Greek pyro (fire or fever) and ptosis 
(to fall)°. Yet despite the identification of many 
potential caspase-1 substrates, the key mol- 
ecules involved in pyroptosis have remained 
a mystery. 

Now, Shi et al. and Kayagaki et al. have 
uncovered how inflammatory caspases induce 
pyroptosis. Taking advantage of the devel- 
opment of CRISPR/Cas9 genome-editing 
methods, Shi et al. used a genome-wide 
screen to identify host factors involved in cas- 
pase-11- and caspase-1-induced pyroptosis. 
Kayagaki et al. took a different tack, using mice 
subjected to random mutation (through 
ENU mutagenesis) to find mediators of cas- 
pase-11-dependent non-canonical inflam- 
masome signalling. Strikingly, both groups 
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identified the same gene, Gsdmd, which 
encodes gasdermin D, a member of the poorly 
characterized gasdermin protein family. They 
confirmed their screening results through sev- 
eral experimental approaches, demonstrating 
that gasdermin D is essential for pyroptosis 
induced by non-canonical inflammasome 
activation in mouse and human cells (Fig. 1a). 
Kayagaki et al. also show that Gsdmd deficiency 
reduces LPS-induced lethality in mice, which 
is known to depend on caspase-11-induced 
pyroptosis. 

Although both groups find that gasdermin D 
is essential for pyroptosis induced by caspase-4, 
-5 or -11, their results diverge on the require- 
ment of gasdermin D for pyroptosis resulting 
from caspase-1 activation (Fig. 1b). Con- 
firming the results of their CRISPR/Cas9 
screen that identified Gsdmd as an essential 


gene for caspase-1-induced pyroptosis, 
Shi and colleagues found that pyroptosis is 
abolished in Gsdmd-deficient macrophages 
(a type of immune cell) exposed to activators 
of canonical inflammasomes for short times. 
Kayagaki et al. present similar results; but 
after prolonged incubation, which may better 
reflect physiological processes, they no longer 
detected differences between the pyroptotic 
responses of wild-type and Gsdmd-deficient 
macrophages. For now, the jury is still out on 
whether gasdermin D is an essential mediator 
of caspase-1-induced pyroptosis; if it is not, 
then other pyroptosis-inducing caspase-1 
substrates must await discovery. 

Another enigma in the inflammasome field 
is the release of mature IL-1 after it has been 
cleaved by caspase-1. IL-1 lacks the amino- 
acid sequence that would direct it to the 
Golgi apparatus’, a cellular substructure that 
mediates protein secretion, so it is thought to 
be released by an unconventional secretion 
mechanism. Shi et al. show that, in Gsdmd- 
deficient macrophages, caspase-1 activity and 
IL-1 processing were not affected after canon- 
ical inflammasome activation, but secretion of 
the mature cytokine was completely abrogated. 
These results strongly support the hypothesis 
that IL-1 is released following pyroptotic lysis 
of the cell. 

How do inflammatory caspases activate 
gasdermin D to induce pyroptosis? Consistent 
with the previous identification of the protein 
as a caspase-1 substrate’, both groups noted 
that gasdermin D is cleaved by caspase-1 or 
caspase-11 at the aspartate 276 amino-acid 
residue following canonical or non-canonical 
inflammasome activation, and that muta- 
tion of this cleavage site completely abolished 
pyroptosis. The N-terminal domain that was 
left after cleavage was sufficient to induce cell 
death showing morphological features of pyro- 
ptosis, and the carboxy-terminal domain was 
found to bind the N-terminal domain if the 
former was overexpressed, blocking cell death. 
These results are consistent with the hypoth- 
esis that caspase-dependent cleavage releases 
the N-terminal domain of gasdermin D from 
an inhibitory interaction with its C-terminal 
domain, thereby unleashing its cell-killing 
properties. 

Finally, the studies highlight the role of 
other gasdermin family members in pyro- 
ptosis. In mice, gain-of-function mutations 
in Gsdma3 cause skin thickening and hair loss 
that is associated with chronic skin inflam- 
mation’. Shi et al. show that these mutations 
abrogate the interaction between the C- and 
N-terminal domains of gasdermin A3, and 
that the N-terminal domain of this protein also 
induces pyroptosis. Because only gasdermin D, 
and no other gasdermins, features a caspase 
cleavage site, further research is necessary to 
understand the context and mechanism of this 
seemingly similar activation process. 

The identification of gasdermin D as a 


key mediator of pyroptosis is a conceptual 
advance in our understanding of how inflam- 
matory caspases cause cell death. The studies 
present compelling evidence that caspase- 
induced cleavage of a single substrate, and 
not a spectrum of different substrates, is 
sufficient to induce pyroptosis and all the 
associated morphological features — simi- 
lar to the action reported for two substrates 
of apoptotic caspases, ATP11C (ref. 9) and 
ROCK! (refs 10, 11). Precisely how the gas- 
dermin D N-terminal domain exerts its cyto- 
toxic function should be addressed in future 
studies. Does it simply relay the signal to 
yet-unknown pyroptosis inducers, or might 
the fragment itself be an executioner of 
cell death, like the pseudokinase protein 
MLKL, for example, which drives necroptotic 
cell death by forming pores in the cell 
membrane’? Elucidating the mechanism of 
gasdermin D’s cytotoxic activity is likely to 
bring further exciting discoveries, and could 
pave the way for therapeutic approaches for 
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Random sudoku light 


A clever approach has been used to imprint a phase pattern on a laser beam. The 
pattern is not only random at each point, but also depends on information stored 


elsewhere in the pattern. 


TON] EICHELKRAUT & ALEXANDER SZAMEIT 


determines your options? Only the car 
directly in front of you: if it travels slowly, 
you drive slowly; ifit stops, so must you. How- 
ever, your decisions are independent of all 
other drivers’ decisions. This observation has 
profound physical consequences, because it 
leads to the distinc- 


[e= that you are stuck in traffic. What 


The authors tion between two 
imprinted on classes of random 
the laser beam physical system, 
aphase pattern knownas Markovian 
that corresponds — and non-Markovian 
to numerical after the math- 
solutions to ematician Andrey 
overlapping Markov’. Traffic 
sudokus. jams are Markovian 


systems, and, so far, 
this has also been the case for light that has 
a random spatial distribution in intensity or 
phase. Writing in Physical Review Letters, 
Fischer et al.* report the first experimental 
demonstration of non-Markovian light. 

The behaviour of each member of a 
Markovian system is determined only by its 
nearest neighbours. By contrast, each mem- 
ber of a non-Markovian system depends on 
the behaviour of a larger region, or even of the 


entire system. Light fields that have random 
spatial distributions of phase have so far been 
generated exclusively as the Markovian type, 
using processes such as scattering in diffusive 
optical media’. Fischer and colleagues use a 
clever phase-imprinting approach to form 
a light field that has a fully random phase at 
each spatial point and a local random nature 
that is determined not only by the nearest- 
neighbouring points, but also by regions 
that are farther away. 

Markov originally formulated his ground- 
breaking theory with dynamically evolving, 
random systems in mind. He formalized the 
treatment of an important feature of such sys- 
tems: how a system's ‘memory’ of its previous 
states affects its future states. Markov realized 
that, in a system that has no memory, future 
events can be predicted solely on the basis of 
the system’s present state: the defining property 
of a Markovian process is that the past and the 
future are mutually independent. Examples 
include the motion of dust floating in the air, a 
sequence of simple coin tosses or the PageRank 
algorithm that Google uses to search the 
Internet. The defining property of a Markovian 
process can be generalized for systems that 
have random spatial distributions, the traffic 
jam being one of the simplest examples. 

As an example ofa random non-Markovian 
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Figure 1 | Non-Markovian light. Fischer et al.’ used overlapping sudoku puzzles to imprint a spatially 
non-Markovian phase pattern on laser light using a device known as a spatial light modulator (SLM). 
The authors filled the cells in the upper left square of the SLM with phase information corresponding to 
the numbers of a correctly solved 9 x 9 sudoku (blue). They then shifted the puzzle frame down (or to the 
right) by six lattice cells, and filled it with a phase pattern corresponding to a sudoku solution for the new 
position, which incorporates the numbers from the bottom three rows (or rightmost three columns) of 
the first solution; new solutions are shown in pink. In this process, the number found in each sudoku cell 
depends not only on those in the nearest cells, but also on those in all other cells of the corresponding 
sudoku — that is, the pattern is non-Markovian. This is in contrast to the Markovian pattern of a fully 


random distribution of numbers (not shown). 


process, think of an urn that contains two red 
balls and one blue ball. On three consecutive 
days, one ball is drawn and not replaced. If yes- 
terday a red ball was drawn and today a blue 
ball is drawn, then it is clear that tomorrow a 
red ball will be drawn. We know this for certain 
because we have taken into account the events 
that have affected the urn’s state today and 
yesterday; that is, tomorrow’s draw depends 
not only on today’s draw, but also on 
past draws. 

To generate light that has a non-Markovian 
spatial distribution, Fischer et al. modified 
the transverse phase pattern of a laser beam 
using a device known as a spatial light modu- 
lator (SLM; an array of liquid-crystal cells 
akin to a computer screen). The authors then 
imprinted on the laser beam a particular phase 
pattern that corresponds to numerical solu- 
tions to overlapping sudoku puzzles; these 
were chosen because they are non-Markovian 
systems (Fig. 1). 

Fischer and colleagues started this process 
by filling the upper left square of the SLM 
array with phase information correspond- 
ing to the numbers of a correctly solved 9 x9 
sudoku. They then shifted the boundaries of 
the sudoku frame by a certain number of cells, 
either vertically or horizontally, and solved a 
new puzzle at the new grid position, adjust- 
ing the SLM’s phase accordingly. This proce- 
dure was repeated until the entire SLM array 
was filled with phase information that had a 
random, but non-Markovian, distribution. 


The phase imprint of light generated in 
this way is non-Markovian because the 
information contained in a given SLM 
cell depends not only on the contents of 
the nearest cells, but also on those of all 
other cells of the corresponding sudoku 
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frame. A remarkable aspect of the authors’ 
implementation of this process is that they 
used a tuning parameter to control the 
number of cells by which the sudoku frames 
can be shifted. Using this parameter, the 
authors investigated the transition between 
periodic and non-Markovian light, by tuning 
the imprinted phase pattern from one that 
corresponds to strictly periodic (largely over- 
lapping) sudokus to random sequences of 
independent puzzles. 

Fischer and co-workers have shown that 
light that has a non-Markovian spatial distri- 
bution can be generated in a highly flexible 
manner using a simple and fast set-up. These 
fascinating results pave the way to generating 
new types of ‘patterned light beam, in which 
properties such as intensity or polarization 
are used to store information. The authors’ 
technique might help scientists to achieve 
a deeper understanding of non-Markovian 
processes. Moreover, the ‘sudoku approach’ 
could serve as a general strategy for build- 
ing optical or other systems that have the 
singularly defining non-Markovian property, 
spatial memory. m 
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Pigments on the move 


In plant cells, the pigment anthocyanin is transported to a membrane-bounded 
organelle called the vacuole for storage. A previously unidentified transport 
pathway involving vacuolar-membrane extensions mediates this process. 


DIANE C. BASSHAM 


igment molecules called anthocyanins 
give plant cells a dark red or purple 


colour that attracts pollinating insects 
to flowers' and protects leaves against dam- 
age from ultraviolet light”. Anthocyanins are 
stored in intracellular structures called antho- 
cyanin vacuolar inclusions (AVIs) within a 
large organelle called the vacuole, the environ- 
ment of which protects the pigments from oxi- 
dation and subsequent loss of colour. There has 
been much debate’ * about how anthocyanins 
are transported to AVIs from their site of syn- 
thesis in another organelle, the endoplasmic 
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reticulum. Writing in The Plant Cell, Chanoca 
et al.° describe a previously unknown pathway 
of intracellular transport in which extensions 
of the membrane that surrounds the vacuole 
engulf and internalize anthocyanins. 

Several models for anthocyanin transport 
have previously been proposed. One model’ 
states that anthocyanins are carried directly 
across the vacuolar membrane by transport 
proteins, with the help of glutathione S-trans- 
ferase enzymes. A second’ proposes that 
anthocyanins first translocate into the interior 
of the endoplasmic reticulum, a small part of 
which then pinches off to form membrane- 
bounded vesicles that transport the pigment 


a Vacuole 


Cytoplasm 


Anthocyanin 
clump 


Figure 1 | Transport of plant pigments. a, In plant cells, pigment 
molecules called anthocyanins are synthesized in an organelle called the 
endoplasmic reticulum (ER). Chanoca et al.° propose a mechanism by which 
anthocyanins are transported to another organelle, the vacuole, for storage. 
b, In their model, clumps of anthocyanin move from the ER across the 


and expel it into the vacuole, analogous with 
the well-studied mechanism by which proteins 
are transported to the vacuole. 

A third model’ involves an autophagy-like 
pathway. Autophagy is the process by which 
unwanted materials are transported from the 
cytoplasm into the vacuole for degradation, 
and autophagy-related pathways can also 
transport materials for storage’. In one type of 
autophagy, dubbed macroautophagy, vesicles 
called autophagosomes, which have a dou- 
ble membrane, form around the cytoplasmic 
cargo. The outer autophagosome membrane 
fuses with the vacuolar membrane, deposit- 
ing the inner membrane and the enclosed 
cargo into the vacuolar interior. In this model, 
the engulfment of part of the endoplasmic 
reticulum and its associated anthocyanins 
by autophagosomes leads to their deposition 
inside the vacuole. 

Chanoca et al. attempted to distinguish 
between these models using fluorescence 
microscopy to study cells harbouring a 
fluorescent dye that labels membranes, and 
transmission electron microscopy to study 
the process at high resolution. They assessed 
the structure of AVIs in leaf cells from the 
model plant Arabidopsis thaliana and in cells 
from the purple petals of Eustoma grandiorum 
(commonly known as lisianthus). The first 
two models of anthocyanin transport predict 
that AVIs would not be bounded by mem- 
branes, whereas the third model predicts that 
the AVIs would have two membranes — one 
derived from the autophagosome inner mem- 
brane and the other from the engulfed endo- 
plasmic reticulum. Surprisingly, the authors 
found that the AVIs were surrounded by a 
single membrane, compelling them to rethink 
current models. 

To explain their findings, Chanoca and 
colleagues again turned to microscopy to 
analyse plant cells that produce a fluorescent 
protein that labels the vacuolar membrane. 
This strategy allowed the authors to recon- 
struct the events leading to the vacuolar uptake 
of anthocyanins. They observed that clumps 


of anthocyanin in the cytoplasm outside the 
endoplasmic reticulum initially become tightly 
pressed against the vacuolar membrane, 
which puts out extensions that wrap around 
the anthocyanins. These extensions finally 
fuse, releasing anthocyanins surrounded by 
a single, vacuole-derived membrane into the 
vacuole (Fig. 1). This mechanism is remi- 
niscent of a process called microautophagy 
— in which the vacuole engulfs cytoplasmic 
material directly — rather than of the indirect 
macroautophagy route. 

Although there has been speculation that 
microautophagy occurs in plants*, mechani- 
stic information is lacking. In other organisms, 
microautophagy, macroautophagy and simi- 
lar processes typically depend on a group of 
core autophagy (ATG) genes’. To assess the 
relationship between anthocyanin uptake 
and known autophagy pathways, the authors 
analysed the formation of AVIs in cells that 
do not express the gene ATG5, and in cells 
treated with a drug that blocks autophagy. 
AVIs formed normally in both cases, indicat- 
ing that, despite the morphological similari- 
ties, anthocyanin uptake and autophagy are 
distinct at the molecular level. The antho- 
cyanin-uptake pathway therefore seems to bea 
previously unidentified type of mechanism for 
vacuolar uptake of material. It should be noted, 
however, that this finding does not discredit 
the other models of anthocyanin transport, 
because there could be more than one uptake 
mechanism at work. 

The molecular pathways that direct defor- 
mation of the vacuolar membrane and so 
promote anthocyanin engulfment remain to 
be determined. It is possible that a subset of 
ATG proteins not studied by Chanoca et al. is 
required, and analysis of other mutant plants 
could address this issue. However, it seems 
more likely that a different mechanism is 
responsible for the generation of membrane 
protrusions from the vacuole, potentially 
involving proteins that induce membrane 
curvature. 

How anthocyanins are selected for 
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cytoplasm and become tightly pressed against the vacuole. c, The membrane 
that surrounds the vacuole extends around the clump. d, The membrane then 
fuses, creating a single-layered membrane structure around the pigment. 

e, This structure, called an anthocyanin vacuolar inclusion (AVI), is then 
released into the vacuole. 


engulfment over other nearby cytoplasmic 
components is also an open question. One 
possibility is that an anthocyanin-binding pro- 
tein on the vacuolar membrane concentrates 
the pigment at sites of uptake. Alternatively, 
given that some evidence” suggests that antho- 
cyanins and related molecules bind to lipids, 
the pigments might bind directly to the lipid 
bilayer that makes up the membrane, possi- 
bly even causing membrane curvature at the 
binding site. 

Other anthocyanin-related molecules — 
including tannins, which protect plants from 
being eaten, and possible photoprotectants 
called coumarin glucosides — also accumulate 
in vacuolar inclusions, and the mechanisms 
by which they are taken up by the vacuole are 
unknown. It could be, then, that Chanoca and 
colleagues have uncovered a widespread path- 
way for the transport of pigments and related 
molecules into the vacuole. Their pathway 
might yet prove to be crucial beyond its role 
in pigment accumulation, for plant protection 
against herbivores and pests, and for abiotic 
stress tolerance. = 
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Foraging 
further 


King penguins on the Crozet archipelago 
in the southern Indian Ocean travel south 
to forage for food around the Antarctic 
Polar Front, where cold Antarctic waters 
meet warmer sub-Antarctic seas (pictured, 
a king penguin diving). Writing in Nature 
Communications, Bost et al. report that 
climatic variability can alter the birds’ 
foraging behaviour and population dynamics 
(C. A. Bost et al. Nature Commun. http:// 
dx.doi.org/10.1038/ncomms9220; 2015). 
By tracking king penguins (Aptenodytes 
patagonicus) for 16 years, Bost et al. found 
that changes associated with an increased 
sea surface temperature of just 1°C pushed 
the polar front southward, and increased 
both the distances penguins travelled to 
forage and the depths to which they dived 
for food. After large-scale climatic anomalies, 
their population size also fell. Climate models 
predict that the front will continue to shift 
southward, which may threaten penguins 
and their prey. Jennifer R. Gardiner 
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Protein modification 


in atrice 


Organometallic reagents have been developed that chemically modify proteins 
and peptides specifically at cysteine amino-acid residues — potently offering a 
general route to making therapeutically useful compounds. SEE LETTER P.68° 


HEATHER MAYNARD 


roteins are valuable therapeutic and 
Pp imaging agents, but they often need to 
be chemically modified to work opti- 
mally in these roles. If a protein is to remain 
fully biologically active after a chemical group 
has been attached, the modification must 
often take place at a specific, predetermined 
amino-acid side chain. On page 687 of this 
issue, Vinogradova et al.’ report palladium- 
containing organometallic reagents that not 
only modify proteins and peptides at selective 
sites, but also do so quickly and without being 
heated, to generate high yields of products. 
Chemically modified proteins can have 
several advantages over their naturally occur- 
ring counterparts. For example, attaching 
polymers to a protein can increase the time 
that the protein spends in the body, leading to 
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fewer injections for patients~*. Proteins that 
target certain cancerous tissues can be visual- 
ized in vivo if radioactive labels are attached, 
facilitating identification of prognosis and 
treatment monitoring’. And anticancer drugs 
attached to protein antibodies can be as effec- 
tive as the unmodified drugs, but have fewer 
side effects’. 

Therapeutic proteins have conventionally 
been modified non-selectively, but this some- 
times leads to large reductions in biological 
activity — as much as 93% when polymers are 
attached®. However, site-specific modifica- 
tions can be crucial for the modified product 
to retain full biological activity. Many organic 
reagents have been used to modify proteins at 
specific sites, with varying success. By contrast, 
the use of organometallic reagents has lagged 
behind that of their wholly organic counter- 
parts because of the difficulty of finding 
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reaction conditions that are compatible with 
the use of both proteins and organometallic 
reagents. Furthermore, it is hard to achieve 
selectivity in the presence of the many reactive 
chemical groups found in proteins’. 

One approach to achieving site selectivity is 
to exploit either naturally occurring or non- 
natural amino acids that have unique reac- 
tivity, by judiciously placing them at protein 
sites at which the attachment of a chemical 
group will not alter the protein’s function or 
structure. Cysteine is commonly used for this 
purpose, because the thiol group (SH) in this 
natural amino acid’s side chain can be targeted 
selectively depending on the reagent or the pH 
used in the modification reaction. Many pro- 
teins do not have ‘free’ cysteines that contain 
thiols, because these groups often react with 
other cysteine residues in the same protein to 
form disulfide bonds (S-S). But modern bio- 
chemical techniques now allowa free cysteine 
to be placed in any position in a protein’s 
amino-acid sequence, thereby pre-directing 
the site of attachment. 

Organometallic approaches to modify- 
ing proteins’”* have most often targeted non- 
natural amino-acid residues or certain natural 
ones (lysine, tyrosine or tryptophan residues), 
but rarely free cysteines. In Vinogradova and 
colleagues’ method, the organometallic rea- 
gent reacts specifically with the thiol groups 
of free cysteines across a broad pH range 
(5.5-8.5). The reactions are complete within 
minutes, and can be performed using low 
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Figure 1 | Site-selective reactions with proteins. Vinogradova et al.' describe organometallic reagents 
that react selectively with thiol groups (SH) in the side chains of cysteine amino-acid residues in proteins. 
Here, a thiol group in an antibody protein (blue) reacts with a palladium complex that contains the 
anticancer drug vandetanib (red), to produce an antibody—vandetanib conjugate; on average, 4.4 thiols 
in the antibody were modified. The reactions proceed in high yields, at room temperature, to form stable 


bonds within minutes. 


concentrations (micromolar concentrations 
or less) of a protein or peptide — a useful 
attribute because these biomolecules are often 
available in only small quantities. Remarkably, 
the reaction yields are normally 100%, and the 
resulting bonds are chemically robust. 

The authors report that their reactions work 
in aqueous solutions (which are required for 
proteins), with the addition ofa small amount 
of organic solvent. The transformations also 
work in air, allowing them to be set up easily 
on the bench top rather than under an inert 
atmosphere, as is often needed for organo- 
metallic reactions. 

The palladium reagents used are simple to 
prepare from a range of readily available pre- 
cursors (aryl halides and trifluoromethane- 
sulfonates). Although the reagents must be 
synthesized under an inert atmosphere, once 
made, most can be stored in refrigerated vials 
for months without degradation, in much the 
same way as proteins. This might therefore 
make it possible for kits of various palladium 
reagents to be made available to researchers 
who do not have access to the chemicals or 
equipment needed to make the reagents. 

Vinogradova et al. went on to show that the 
organometallic reagents can be used to pro- 
duce therapeutically interesting compounds 
known as staple peptides and antibody-drug 
conjugates (ADCs). In staple peptides, small- 
molecule linkers bridge two amino-acid resi- 
dues to reduce the peptide’s flexibility. The 
resulting molecules can have higher affinity for 
a biological target than non-stapled analogues, 
or be less susceptible to breakdown by enzymes 
in the body’. The authors used their approach 
to staple a peptide in less than 10 minutes and 
with 100% yield. 

ADCs* consist of small-molecule drugs 
attached to antibodies; two ADCs have 
been clinically approved for treating cer- 
tain forms of cancer. Such drugs hold the 


promise of high efficacy with fewer side 
effects than small-molecule anticancer drugs 
because they specifically target the malignant 
tissue. Clinically approved ADCs are prepared 
using ‘maleimide-thiol’ reactions to connect 
antibodies to the drugs, but these reactions 
are reversible in vivo unless the chemistry of 
the resulting linkage is carefully designed to 
prevent this’°. Vinogradova and co-workers 
used their new reaction to attach the anti- 
cancer drug vandetanib to free cysteine 
residues of an antibody (Fig. 1). 

An advantage of the authors’ method is 
that the reaction product is more stable than 
analogues formed using maleimide-thiol 
reactions. In tests with one of the products, 
83-90% of the peptide remained intact after 
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treatment with acids, bases and external thiol 
compounds. Thus, the method might pro- 
vide a new way to prepare ADCs, as well as 
other compounds for which stable linkages to 
proteins are desirable. 

The antibodies and staple peptides used in 
Vinogradova and colleagues’ work are fairly 
stable. It remains to be seen whether more- 
sensitive proteins that unfold easily can also 
be used. In addition, the amount of palladium 
used in the reactions will need to be reduced 
to low levels, so that unacceptable amounts 
of the metal are not left as impurities in pro- 
teins prepared for medical applications. The 
authors partly addressed this point by demon- 
strating that 91-94% of the palladium could be 
removed from the product. If these two issues 
can be resolved, then the possibilities for this 
powerful technology are exciting indeed. m 
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Homo sapiens in China 
80,000 years ago 


A discovery in southern China of human teeth dated to more than 80,000 years 
old indicates that Homo sapiens was present in the region considerably earlier 
than had previously been suspected. SEE LETTER P.696 


ROBIN DENNELL 


ebate over when our species, Homo 
D sapiens, first dispersed from Africa 

across southern Asia is hindered by 
a lack of relevant fossil evidence between the 
eastern Mediterranean and southeast Asia. 
Exciting new material is presented in this 
issue by Liu et al.' (page 696), who describe 
a collection of H. sapiens teeth from a cave 


in southern China’s Hunan province. The 
age and morphology of the teeth suggest that 
modern humans reached southern China 
long before they had arrived in northern 
China or in Europe. 

Most researchers agree that our species 
first appeared in East Africa around 190,000 
to 160,000 years ago, and then dispersed into 
the eastern Mediterranean around 100,000 to 
60,000 years ago, after which it was replaced by 
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Figure 1 | Human dispersal. Liu and colleagues’ discovery! of modern-human teeth at Fuyan Cave 
in southern China implies that Homo sapiens dispersed towards the east and south from Arabia or the 
eastern Mediterranean between 120,000 and 80,000 years ago, considerably earlier than the species 

is known to have first arrived in Europe and northern China, around 40,000 years ago. The delayed 
dispersal to these northern regions may be explained by the well-established resident Neanderthal 
populations in Europe and Siberia, and the harsh, sub-zero winter conditions. 


Neanderthals. Following this apparent ‘failed 
dispersal’, one suggested scenario is that 
H. sapiens did not progress eastwards until 
around 60,000 years ago — a date based on the 
estimated divergence time of genetic lineages 
in South Asian people’ and the probable arrival 
date of humans in Australia*. This picture is 
supported by apparent similarities between 
small stone tools known as microliths from 
South Africa dating to around 60,000 years 
ago and ones from South Asia that are between 
30,000 and 36,000 years old*”. These similari- 
ties have been interpreted as showing a direct 
link between the earliest modern humans in 
South Asia and their probable African ances- 
tors’, although the similarities now seem less 
robust than was first argued®. A contrasting 
scenario is that H. sapiens first dispersed east- 
wards (probably by way of the Arabian Penin- 
sula) during the last interglacial period, and 
entered South Asia well before 60,000 years 
ago, and perhaps before the volcanic super- 
eruption that occurred 74,000 years ago at 
present-day Lake Toba in Indonesia”. 

One place where these hypotheses can be 
tested is in southern China, which is dotted 
with karst caves that are rich fossil sources. 
But descriptions of this material have so far 
been ambiguous because of doubts over the 
stratigraphic context of skeletal specimens, 
their dating and/or their identification. Some 
finds were made by villagers while digging for 
fertilizer, which obscured the stratigraphic 
context of the fossils; in other cases, fossil 
teeth are too worn for identification purposes, 
or the association between a dated flowstone 
deposit and a skeletal specimen is unclear’. 
At Zhiren Cave in southern China, for exam- 
ple, a mandible (lower jawbone) attributed to 
H. sapiens was found in a geological sequence 
dated by five flowstones ranging from 110,000 
to 55,000 years old!!, so the specimen may be 
younger than the published age of 110,000 
years’; it has also been suggested that the man- 
dible is from a late Homo erectus individual”. 


Liu and colleagues’ discoveries at Fuyan 
Cave are especially welcome because they 
seem to lack these usual problems sur- 
rounding context, dating and identification. 
A flowstone (layer 1) covers the entire cave 
floor, so the underlying material clearly has 
to be older; a stalagmite on this flowstone 
was dated to a minimum age of 80,100 years 
old (+ 1,200 years). The underlying layer 2 is 
a thin, sandy clay around 20-50 centimetres 
thick, in which the authors identify numerous 
mammalian fossils from 38 extant species and 
5 extinct large mammals. This fauna is iden- 
tified as being from the Upper Pleistocene 
period (125,000 to 10,000 years ago), so the 
human teeth, which were found in layer 2 in 
part of the cave, are estimated to be between 
80,000 and 120,000 years old. 

The teeth are well preserved and show 
detailed cusp morphology. All 47 are 
unequivocally attributed to H. sapiens. The 
authors describe them as generally smaller than 
other African and Asian specimens from the 
Upper Pleistocene, and closer in size to those 
of Upper Pleistocene Europeans and contem- 
porary modern humans. The M’ molar teeth 
in the sample are different in shape from the 
rhomboidal contours displayed by Neander- 
thals or the elongated teeth seen in Asian 
H. erectus fossils. Instead, the relative cusp 
and occlusal polygon areas of the M’ molars 
are almost identical to those of modern 
Chinese populations. They also seem less 
primitive than northern Chinese specimens 
suchas those discovered at the Xujiayao site™*. 

This discovery has several implications. The 
finding that H. sapiens first appeared in south- 
ern China between 120,000 and 80,000 years 
ago indicates that our species dispersed across 
southern Asia well before 60,000 years ago 
(Fig. 1). Furthermore, the fact that the teeth 
resemble those of Upper Pleistocene Euro- 
peans and modern humans implies that the 
population they came from were immigrants 
and not the outcome of local evolution from 
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H. erectus. To place these finds in their conti- 
nental context, the Fuyan teeth indicate that 
modern humans were present in southern 
China 30,000 to 60,000 years earlier than in 
the eastern Mediterranean and Europe. This is 
not surprising, perhaps. H. sapiens originated 
in or near the tropics, so it makes sense that the 
species’ initial dispersal was eastwards rather 
than northwards, where winter temperatures 
rapidly fell below freezing. As Liu and col- 
leagues point out, the finds also imply that 
modern humans were in southern China long 
before there is evidence for them in northern 
China and Europe. In the case of northern 
China, the earliest evidence is from Tianyuan 
Cave near Beijing, dated to around 40,000 years 
ago"; in Europe, evidence appears around 
45,000 to 40,000 years ago'®. Here, the authors 
suggest, the presence of Neanderthals may 
have delayed the arrival of modern humans. 
However, the predominantly colder winter 
conditions of the enormous landmass between 
Europe and northern China may better explain 
the earlier colonization of southern zones. 

Excavation of other caves in the region will 
undoubtedly add to the findings from Fuyan. 
Whatis especially needed nowis archaeological 
evidence (sadly lacking in Fuyan Cave) to 
indicate whether the initial dispersal of our 
species was caused or facilitated by cognitive 
developments (such as symbolism or complex 
exchange systems), or was simply an example 
of opportunistic range extension. More revela- 
tions about our species’ history can surely be 
expected from southern China. = 
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QUANTUM PHYSICS 


Death by experiment 
for local realism 


A fundamental scientific assumption called local realism conflicts with certain 
predictions of quantum mechanics. Those predictions have now been verified, 
with none of the loopholes that have compromised earlier tests. SEE LETTER P.682 


HOWARD WISEMAN 


he world is made up of real stuff, 

| existing in space and changing only 
through local interactions — this local- 
realism hypothesis is about the most intuitive 
scientific postulate imaginable. But quantum 
mechanics implies that it is false, as has been 
known for more than 50 years’. However, bril- 
liantly successful though quantum mechan- 
ics has been, it is still only a theory, and no 
definitive experiment has disproved the local- 
realism hypothesis — until now. On page 682 
of this issue, Hensen et al.” report the first 
violation of a constraint called a Bell inequal- 
ity, under conditions that prevent alternative 


Juanita 


Electron 
Entanglement 


explanations of the experimental data. Their 
findings therefore rigorously reject local 
realism, for the first time. 

Bell inequalities are named after John Bell, 
the physicist who discovered in 1964 that 
the predictions of quantum mechanics are 
incompatible with the local-realism hypoth- 
esis’. There are many different ways to make 
this hypothesis precise’, but Hensen and col- 
leagues’ exposition basically follows Bell’s 
original formulation, which states it as the 
conjunction of two other hypotheses: realism 
(which Bell called predetermination), essen- 
tially meaning that measurements reveal pre- 
existing physical properties of the world; and 
locality, roughly meaning that any change 


Figure 1 | Violation of a three-party Bell inequality. A Bell inequality is a mathematical relationship 
regarding the statistics of measurement outcomes obtained by two or more parties. Under certain physical 
conditions relating to the timing of events, a violation of a Bell inequality proves that local realism — a 
hypothesis satisfied in all of science except quantum mechanics — is false. Hensen et al.” have violated a 
Bell inequality in such a way that the requisite physical conditions were satisfied for the first time, using 
the scheme shown in this cartoon. a, At separate locations, Alice and Bob create entangled states of an 
electron and a photon, then send the photons to Juanita’s laboratory. b, Alice and Bob randomly choose 

a setting for measurements of their respective electrons. c, They obtain their measurement outcomes, 
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50 Years Ago 


It may not be generally realized 

that work is in progress on the 
colossal project of constructing a 
40-in. diameter, 300 miles long, 
Trans-Alpine oil pipeline to convey 
oil from the Adriatic to the heart 

of Germany ... Among the many 
practical problems concerned with 
such a project, apart from tunnelling 
and mechanical excavation in the 
high Alps, are the necessity to dredge 
the harbour at Trieste so that it can 
eventually accommodate oil tankers 
of 160,000 dead weight tons; setting 
storage tanks there on piles because 
available land is a rocky hill site; 
construction of several thousand feet 
of piers in the Adriatic ... Involved 
also in the scheme is the building of 
five separate pumping stations, each 
equipped with two 4,000-horse- 
power electric centrifugal pumps 
required to lift hundreds of 
thousands of tons of oil from 
sea-level to one of the highest 

points of Felber Tauern. 

From Nature 30 October 1965 


100 Years Ago 


‘Distances at which sounds of heavy 
gun-firing are heard’ — Referring to 
the correspondence on this subject, 
Ihave been collecting information 
as to places at which the sound of the 
firing in Belgium has been heard in 
this country ... Here, at a distance 
of about 125 miles from Ypres 
(taking that town for convenience, 
as a known centre) I have heard 
firing quite unmistakably since the 
beginning of the war — often all day, 
and for many days in succession, 
and frequently at night too. So far 

as I have been able to ascertain, 

the greatest distance from Ypres 

at which the firing has been heard 
unmistakably is about 140 miles ... 
Observations seem to show that the 
direction of the wind has less to do 
with the transmission of the sound 


and Juanita performs a joint measurement of the photons. Alice’s and Bob's outcomes are purely random 
unless Juanita gets a rare successful outcome (as shown here) that indicates entanglement between Alice’s 
and Bobs electrons. By collating the results over many runs, Hensen et al. showed that a Bell inequality 
had been violated by a statistically significant amount. 


than certain atmospheric conditions. 
From Nature 28 October 1915 
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enacted at one place cannot have an effect at 
another place unless there would have been 
time for a light signal to get from the first place 
to the second place. The speed of light is rel- 
evant because, according to Einstein's theory of 
relativity, no causal influence can travel faster 
than light. 

A Bell inequality is a mathematical relation- 
ship regarding the statistics of measurement 
outcomes obtained by two or more parties, 
and also involving the measurement settings 
chosen by those parties. Suppose that the par- 
ties are in well-separated laboratories, and 
that the measurement settings are chosen and 
implemented, and the outcomes obtained, in 
a sufficiently short time that the only way the 
choice of setting by any party could affect the 
outcome of any other party would be through 
a faster-than-light influence. Then, by defini- 
tion, all Bell inequalities will be satisfied by all 
local-realistic theories. An experiment vio- 
lating a Bell inequality therefore implies that 
either locality or realism is false. Bell’s theorem 
is that, according to quantum mechanics, such 
an experiment is possible if the parties share 
particles prepared ina suitable entangled state. 
Entanglement is a holistic property of a sys- 
tem of quantum particles that can persist even 
when the particles are far apart. 

Bell inequalities have been violated experi- 
mentally many times before*”. However, all 
of these experiments had loopholes. Either 
the parties were not far enough apart, given 
how long it took for the processes involved 
(randomly choosing a setting, adjusting the 
apparatus appropriately and obtaining an 
outcome), or the measurements were inef- 
ficient, so that quite often no outcome at all 
was registered. The inefficiency is relevant 
because it can allow the existence of local 
realistic theories — albeit highly contrived 
ones — that exploit the existence of null 
outcomes to simulate the correlations of 
quantum mechanics. 

Several groups worldwide have been 
racing to perform the first Bell experiment 
that combines large separation, efficient 
detection and fast operation of the apparatus. 
Hensen et al. have won the race by using a new 
scheme. Previously, the leading approach was 
to prepare an entangled state of two photons, 
send one to one laboratory — conventionally 
called Alice's — and the other to a second 
laboratory, Bob’s. By contrast, Hensen and 
colleagues’ experiment should be regarded 
as using a three-party Bell inequality. 

In this three-party approach (Fig. 1), Alice 
and Bob each prepare an entangled state of a 
photon and an electron, keep their electrons 
in a diamond lattice and send their photons 
to Juanita, as I'll call her. Alice and Bob then 
each choose a setting and measure their elec- 
trons, which can be done efficiently, while 
Juanita performs a joint measurement on the 
two photons. Alice's and Bob's outcomes will 
be purely random unless Juanita gets a rare 


‘successful result, in which case the outcomes 
indicate entanglement between Alice’s and 
Bobs electrons. Unlike Alice and Bob, Juanita 
always makes the same measurement, and so 
its inefficiency does not open a loophole. 

Hensen and co-workers’ experiment was 
made possible only by combining state-of-the 
art quantum technologies — it was performed 
in the Netherlands, but used diamond sub- 
strates prepared in the United Kingdom and 
fast random-number generators developed in 
Spain. Maintaining optimal operation of all the 
devices during the experiments was extremely 
challenging, and the rate of events (defined 
as Juanita getting a successful outcome) was 
only about one per hour. As a consequence, 
only 245 such events were recorded, and the 
statistical uncertainty in the reported Bell- 
inequality violation is comparatively large. 
Nevertheless, from a careful analysis of the 
entire data set, including runs in which Juan- 
ita did not get the desired outcome, Hensen 
et al. reject the local-realism null hypothesis at 
a confidence level conventionally considered 
to be statistically significant. It is to be hoped 
that more data will be generated soon. 

The authors’ approach might allow them to 
implement quantum-information protocols 
enabling secure communication, even when 
the devices used are not trusted by the users. 
For this to be practical, the event rate would 
have to be massively increased above its cur- 
rent level. However, the basic technology and 


NON-CODING RNA 


the scheme (involving joint measurements by 
the intermediary Juanita) are promising. 

The immediate significance of the reported 
experiment, however, is in hammering the 
final nail in the coffin of local realism. Some 
almost metaphysical loopholes remain open — 
if the results can be replicated with humans, 
rather than machines, freely choosing the 
measurement settings and consciously reg- 
istering the outcomes, then the coffin will 
have been interred and buried. That experi- 
ment, however, is for many years hence. For 
the moment, we should celebrate Hensen and 
colleagues’ landmark achievement in physics. m 
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Antibiotic tricks 


a switch 


Ascreen for compounds that block a bacterial biosynthetic pathway has 
uncovered an antibiotic lead that shuts off pathogen growth by targeting a 
molecular switch in a regulatory RNA structure. SEE ARTICLE P.672 


THOMAS HERMANN 


he golden age of antibiotic discovery, 

between 1940 and 1960, was heralded 

by the work of Selman Waksman. A bio- 
chemist and microbiologist, Waksman coined 
the term ‘antibiotic’ and was the first to use 
systematic screening to discover antibacterial 
leads’. Efforts in his laboratory yielded more 
than 20 natural antibiotics — most notably 
streptomycin in 1943, the first effective treat- 
ment for tuberculosis”. The discovery won 
Waksman the 1952 Nobel Prize in Physiology 
or Medicine. Waksman’s research caught the 
attention of scientists at the US pharmaceu- 
tical company Merck, and the ensuing col- 
laboration was instrumental in developing 
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streptomycin for clinical use. On page 672 
of this issue, Howe et al.’ from the research 
laboratories of the present Merck describe a 
new antibiotic lead, identified using a sophisti- 
cated refinement of the phenotypic-screening 
approach introduced by Waksman. 

Seven decades after Waksman’s research, 
the flood of antibiotics emerging from natu- 
ral sources has dwindled to a trickle, and, for 
various reasons", few companies remain active 
in antibiotic drug discovery. This is despite an 
urgent clinical need for new agents in the face 
of rising resistance to existing antibiotics. On 
this background, Howe and colleagues’ sur- 
prising discovery of a drug target in a bacte- 
rial non-coding RNA (ncRNA) provides a 
welcome bright spot. 


a Screen for pathway inhibitors 


‘@ ‘@ 
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) RNA 
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Figure 1 | Metabolic-pathway blockade identifies an antibiotic. a, Bacteria use a metabolic pathway 
to synthesize riboflavin — a molecule involved in many enzymatic reactions — when environmental 
riboflavin is not available. Howe et al.’ searched for inhibitors of this pathway by identifying compounds 
whose antibacterial action could be attenuated by riboflavin addition. b, The screen uncovered the 
synthetic molecule ribocil as a selective inhibitor of riboflavin biosynthesis. The authors report that 
ribocil binds to a ‘riboswitch’ regulatory domain in a non-coding region of the messenger RNA that 
encodes a synthase enzyme involved in riboflavin synthesis. The riboswitch is normally bound by flavin 
mononucleotide (FMN), a metabolite produced from riboflavin; such binding induces a structural 
change in the riboswitch that prevents expression of the synthase enzyme and thus stops further 
riboflavin synthesis when sufficient FMN is available. c, Ribocil, despite having a very different structure 
from FMN, also binds at this site, tricking the switch into shutting off riboflavin production and depriving 


the bacterium of this essential metabolite. 


Phenotypic screening involves applying test 
compounds to bacterial cultures and observ- 
ing changes to the cells’ characteristics (their 
phenotype). Since Waksman’s days, the sim- 
plest phenotypic screen for antibacterial activ- 
ity has been to look for growth inhibition — cell 
death is the most severe of phenotypes and is 
readily measured. Howe and colleagues estab- 
lished a more subtle screen that interrogates 
a single bacterial metabolic pathway. Unlike 
humans, bacteria can synthesize the metabo- 
lite riboflavin, also called vitamin B2, which is 
a precursor of the cofactors required in many 
enzymatic reactions. The most prominent 
of these cofactors is flavin mononucleotide 
(FMN). Genes involved in riboflavin biosyn- 
thesis are essential only when the bacteria can- 
not acquire the vitamin from its environment. 
The researchers hypothesized that antibacte- 
rial compounds whose growth-suppressing 
effect could be reversed by supplementing the 
bacterial cultures with riboflavin would be 
candidate inhibitors of the riboflavin biosyn- 
thesis pathway (Fig. 1a). They tested around 
57,000 synthetic small molecules and identified 
the molecule ribocil as one that kills bacteria 
by selectively blocking riboflavin biosynthesis. 

To confirm ribocil’s mechanism of action, 
Howe and colleagues demonstrated that bac- 
teria treated with the compound were indeed 
depleted in riboflavin. In mice infected with 
pathogenic bacteria, ribocil treatment reduced 
the concentration of bacteria by more than 
1,000-fold, which is a promising start. The 
team went on to identify the molecular tar- 
get of ribocil by isolating bacterial clones that 


became resistant during prolonged exposure 
to ribocil at sublethal concentrations, and 
sequencing their genomes. What initially 
seemed a routine exercise turned to excite- 
ment when the resistant bacteria were found 
to harbour mutations in a non-coding stretch 
of the bacterial genome, suggesting that ribo- 
cil blocks a mechanism of gene regulation 
rather than inhibiting a protein target. Fur- 
ther sleuthing for ribocil’s site of attack led 
the researchers to a highly structured ncRNA 
domain upstream of the sequence that marks 
the translational start site ina messenger RNA 
encoding a key synthase enzyme in the ribo- 
flavin biosynthesis pathway. 

It turns out that the ncRNA domain bound 
by ribocil is normally bound by FMN, and the 
site is a ‘riboswitch, the term used for RNA 
regions that change structure when bound by 
a ligand (Fig. 1b). Binding of FMN traps the 
riboswitch RNA in a configuration that pre- 
vents access of transcription and translation 
machinery and thus blocks expression of the 
synthase enzyme. This mechanism provides a 
way for bacteria to reduce further production 
of riboflavin, and thus FMN, when sufficient 
amounts of the cofactor are available. By bind- 
ing to the same ncRNA target, ribocil tricks the 
riboswitch to respond, shutting off riboflavin 
production and depriving the bacteria of the 
essential metabolite. As a compelling piece of 
evidence, the researchers used X-ray crystal- 
lography to provide a snapshot of ribocil in 
the act of binding to the riboswitch RNA. Such 
structural information will be valuable for 
improving the antibacterial activity of ribocil 
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derivatives for potential clinical use. 

This mechanism of mimicking a natural 
ligand of a riboswitch is not known from any 
other antibacterial compounds. The study thus 
conclusively demonstrates the advantages of 
phenotypic screening that allows all constitu- 
ent components of a pathway — proteins and 
nucleic acids — to be tested simultaneously in 
an unbiased fashion. Although riboswitches 
are obvious targets for antibacterial discovery’, 
Howe and colleagues have for the first time 
identified a riboswitch-binding molecule that 
is nota close structural analogue of a metabo- 
lite ligand. This improves the odds that ribocil 
will have no off-target effects on other path- 
ways that involve riboflavin and FMN, which 
is already hinted at by the authors’ observation 
that even high doses of the compound were not 
toxic in mice. 

From a broader perspective, Howe and 
colleagues’ research is a striking demonstra- 
tion that structured regions in ncRNA may 
serve as targets for synthetic drugs. There 
are only a few previous instances of ncRNA 
elements being exploited as potential drug 
targets, one being a region in the hepati- 
tis C virus that is bound by synthetic inhibitors 
of viral protein synthesis’. But many natural 
antibiotics® work by interfering with protein 
synthesis through targeting ncRNA compo- 
nents in the bacterial ribosome, the cellular 
machine that synthesizes all proteins. 

Assuming that nature is on to something by 
targeting ncRNA in the ribosome, an optimist 
might point out that the plethora of ncRNAs 
recently discovered in the genomes of bacteria 
and more-complex organisms will provide a 
rich expansion of the range of targets for thera- 
peutic intervention”"”. For the sceptics, Howe 
and colleagues’ work might allay doubts that 
targeting ncRNA elements may deliver new 
drugs. And we can hope that clinicians who 
urgently need alternative antibiotics will be 
among the first to reap the benefits of RNA 
emerging asa drug target. = 
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Projections from neocortex mediate 
top-down control of memory retrieval 


Priyamvada Rajasethupathy""™*, Sethuraman Sankaran?*, James H. Marshel', Christina K. Kim’?, Emily Ferenczi>?, 


Soo Yeun Lee! 


, Andre Berndt’*, Charu Ramakrishnan!, Anna Jaffe!, Maisie Lo', Conor Liston>* & Karl Deisseroth!**5 


Top-down prefrontal cortex inputs to the hippocampus have been hypothesized to be important in memory 
consolidation, retrieval, and the pathophysiology of major psychiatric diseases; however, no such direct projections 
have been identified and functionally described. Here we report the discovery of a monosynaptic prefrontal cortex 
(predominantly anterior cingulate) to hippocampus (CA3 to CA1 region) projection in mice, and find that optogenetic 
manipulation of this projection (here termed AC-CA) is capable of eliciting contextual memory retrieval. To explore the 
network mechanisms of this process, we developed and applied tools to observe cellular-resolution neural activity in the 
hippocampus while stimulating AC-CA projections during memory retrieval in mice behaving in virtual-reality 
environments. Using this approach, we found that learning drives the emergence of a sparse class of neurons in CA2/ 
CA3 that are highly correlated with the local network and that lead synchronous population activity events; these 
neurons are then preferentially recruited by the AC-CA projection during memory retrieval. These findings reveal a 
sparsely implemented memory retrieval mechanism in the hippocampus that operates via direct top-down prefrontal 


input, with implications for the patterning and storage of salient memory representations. 


Pioneering studies (recently reviewed") have illuminated the molecu- 
lar and physiological mechanisms of information storage at synapses, 
but how populations of individual neurons form network representa- 
tions of memory is largely unknown. Recent studies have reported 
that only a fraction of eligible neurons are allocated during learning to 
form a memory~“, that the population initially encoding the memory 
is later preferentially recruited during retrieval of that memory’, and 
that subsequent activation of the initial ensemble alone can be suf- 
ficient for memory retrieval**. Intriguing questions remain regarding 
(for instance) whether neurons in the memory ensemble have differ- 
ent roles in storage and retrieval, or are controlled by top-down 
influences’”’ distinct from the well-explored bottom-up entorhinal/ 
hippocampal system; such top-down prefrontal projections to the 
hippocampus (if functionally present) might also support bidirec- 
tional communication during memory consolidation’, and would 
potentially be relevant to psychiatric disorders such as post-traumatic 
stress disorder’’, schizophrenia'* and drug addiction’’. It is also 
unclear whether behaviourally salient memories are laid down 
broadly across the brain'®”’, or are wired topographically within the 
local network for improved access'*”'. To address these and other 
questions regarding real-time population-level mechanisms of mem- 
ory storage and retrieval, we developed an approach to visualize and 
perturb top-down modulation of rapidly evolving memory ensembles 
in behaving mammals. 


AC-CA: a direct top-down projection 


To identify direct top-down inputs to the hippocampus, we injected a 
retrograde tracer capable of labelling afferent neurons with tdTomato 
(RV-tdT”) into the hippocampus. We observed robust tdT labelling 
in brain regions with known inputs to the hippocampus, including the 
medial septum, contralateral CA3 and entorhinal cortex (Fig. la and 
Extended Data Fig. la). Additionally, we identified a previously 


uncharacterized input arising from the dorsal anterior cingulate cor- 
tex (AC) and adjacent frontal cortical association cortex, both of 
which are reciprocally connected with the mediodorsal thalamic 
nucleus—a defining feature of the prefrontal cortex (PFC) in rodents 
(Fig. 1a; also confirmed with another retrograde tracer, canine ade- 
novirus (CAV)”’; Extended Data Fig. 1b). Injection of RV-tdT in the 
AC also sparsely labelled neurons bilaterally in the dorsal hippocam- 
pus, consistent with potential bidirectional communication between 
the AC and hippocampus (Extended Data Fig. 1c). To validate further 
the existence of this novel PFC-to-hippocampus projection, we 
injected an anterograde label (adeno-associated virus 5-enhanced 
yellow fluorescent protein (AAV5-eYFP)) into the dorsal AC 
(Fig. 1b) and detected fluorescence-filled projection terminals bilat- 
erally in the striatum and ipsilaterally in the medial dorsal thalamic 
nucleus (both areas are known to receive projections from the PFC), 
but also bilaterally in the hippocampus. 

To determine if these prefrontal projections gave rise to direct 
monosynaptic drive of hippocampal neurons, we transduced the 
AC with an AAV encoding a channelrhodopsin (ChR), and per- 
formed patch-clamp recordings of light-driven excitatory post- 
synaptic currents (EPSCs) in CA1/CA3 cell bodies (Fig. 1c). Cells in 
both CA1 (Fig. 1d) and CA3 (Fig. 1f) reliably responded to light pulse 
trains, and generated evoked EPSC amplitudes sufficient to drive 
action potentials (Fig. 1h, i). Responses when present were fast, with 
mean latency of 3.2 ms in CA1 (n = 26; Fig. le) and 2.7 ms in CA3 
(n= 13; Fig. 1g); along with the observation of sustained evoked 
spikes after 10 Hz stimulation (Fig. 1h, i), this finding was consistent 
with the presence of a direct and efficacious monosynaptic connection 
from the AC onto hippocampal pyramidal cells in the CA3/CA1 
subfields, which we accordingly term the AC-CA projection. No 
responses were observed in dentate neurons (Fig. 1)). 
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Figure 1 | Characterization of AC-CA monosynaptic projection. a, Five 
days after RV-tdT injection into the hippocampus (coordinates specified), 
retrogradely labelled neurons were detected in the AC (arrowhead). The 
injection did not leak below the hippocampus into the medial dorsal thalamus 
(asterisk), a known projection target of the PFC. Scale bars: X5: 300 um; X 10: 
100 um (confocal). A, anterior; D, dorsal; DAPI, 4',6-diamidino-2- 
phenylindole; M, medial; L, lateral; V, ventral. b, Five weeks after AAV5- 
CaMKIlIu::eYFP injection into the AC (coordinates specified), projection fibres 
were visualized in the bilateral striatum (arrows), the bilateral hippocampus 
at the stratum oriens and stratum radiatum of CA1 (one asterisk) and CA3 (two 
asterisks), and the ipsilateral medial dorsal thalamus (arrowhead), while 


AC-CA: causal role in memory retrieval 


To probe the functional importance of this projection, we conducted a 
series of optogenetic experiments to manipulate this pathway before 
and after contextual fear conditioning, and also in the setting of 
memory extinction and reinstatement. We first injected RV-ChR2- 
eYFP (or RV-eYFP) into the dorsal hippocampus, and targeted light 
delivery to retrogradely labelled cell bodies in the AC (Extended 
Data Fig. 2a). On day 1, ChR2 and eYFP mice underwent contextual 
fear conditioning in one context, while a ChR2-expressing control 
group was exposed to the same context without shock. On day 2, all 
mice were placed in a different context, in which the ChR2-expressing 
fear-conditioned group (m= 8) showed significant fear behaviour 
(freezing) only during light stimulation, compared with uncon- 
ditioned ChR2 (no shock, n = 6) or shocked control groups (eYFP, 
n=6, P<0.001, two-way analysis of variance (ANOVA) with 
repeated measures; Extended Data Fig. 2b). The time to freezing 
and time to unfreezing with light on/off switching were largely con- 
sistent across animals. On day 3, mice were placed back in the original 
context, verifying in both ChR2 and eYFP fear-conditioned cohorts 
strong memory encoding and retrieval, with significantly greater levels 
of freezing compared with no-shock controls (Extended Data Fig. 2c; 
P<0.001, unpaired t-test). 

This observation that cells contributing to the AC-CA projection 
can activate contextually conditioned fear behaviour was replicated 
and extended using a complementary anterograde projection- 
targeting strategy. We injected AAVdj-ChR2-eYFP (or AAVdj-eYFP 
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sparing CA2 and the dentate (brackets). The injection did not leak into the 
medial septum, a known input to the hippocampus (caret). Scale bars: <5: 
300 jm (100 Lm maximum projections). L., left; R., right. c, AAVdj- 
CaMKIIo.::ChR2(H134R)-eYFP was injected into the dorsal AC; post-synaptic 
responses were recorded from CA1 pyramidals in acute slice. Scale bars: X 20: 
60 um. d, CAI response amplitudes (n = 26 neurons, n = 6 mice); inset: raw 
traces. e, CA1 response latency (mean 3.2 ms). f, CA3 response amplitudes 

(n = 13 neurons, n = 2 mice); inset: raw traces. g, CA3 response latency (mean 
2.7 ms). h, i, CA1 (h) and CA3 (i) current-clamp traces illustrating spiking 
reliability at 10 Hz. Scale: 10 mV, 250/500 ms. j, No responses were detected in 
dentate neurons (n = 5). 


in a parallel cohort) into the AC, and targeted light stimulation to 
terminals in the hippocampus (Fig. 2a). We again observed significant 
freezing to optical stimulation in the neutral context only in the ChR2 
group (Supplementary Video 1) compared with the no-shock and 
eYFP controls (Fig. 2b; n =8 for all groups; P< 0.001, two-way 
ANOVA with repeated measures). These animals exhibited the same 
characteristic latency to freezing during light stimulation as in the 
previous experiment. We next tested whether this consistent beha- 
vioural response was indeed due to reactivation of a fear memory, 
rather than due to direct nonspecific drive of fear behaviour. The same 
mice corresponding to those shown in Fig. 2a, b were subjected to 
several days of contingency degradation by exposure to context A 
without shock (Methods), after which stimulation with light failed 
to produce significant freezing in ChR2 animals, as with no-shock and 
eYFP controls (Fig. 2c and Supplementary Video 2). Fear condition- 
ing was then reinstated in these mice in a new context, after which 
light stimulation once again reliably produced freezing in ChR2 mice 
compared with no-shock and eYFP controls (Fig. 2d; n = 8 for all 
groups; P<0.001, two-way ANOVA with repeated measures; 
Supplementary Video 3). Preservation of contextual fear memory 
on day 3 and successful fear memory extinction on day 14 were con- 
firmed (Fig. 2e). While these data demonstrated that cells contrib- 
uting to the AC-CA projection can drive fear memory recall, 
it remained possible that any drive of the hippocampus could 
induce retrieval of a recent strongly represented memory in the 
hippocampus. However, we found no evidence for this possibility 
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Figure 2 | AC-CA projections control top-down memory retrieval. 

a, AAVdj-CaMKIIo::ChR2-eYFP (or eYFP alone) was injected into the AC, 
and light was targeted to the dorsal hippocampus 5 weeks after injection. 
Timelines are indicated. FC, fear context; NS, no shock. b-d, Percentage time 
freezing on days 2, 15 and 17: 5s time bins (n = 8; P< 0.001, two-way 
ANOVA with repeated measures) are shown. Throughout this figure, error 
bars represent standard deviation (s.d.), to facilitate comparison with 
previously published literature; however, since freezing behaviour is not 
unbounded, these error bars may overestimate the upper limit of error. 
Summary bar graphs below are mean = s.d. 20s before light (dark bar) versus 
20 s after light (lighter bar). e, Preservation of contextual memory (day 3) and 
contextual extinction (day 14), original context A (mean + s.d., n = 8, 
P<0.001, paired t-test). f, AA Vdj-CaMKIIo::ChR2-eYFP (or eYFP alone) was 
injected into the medial septum, and light stimulation was targeted to the 
dorsal hippocampus 5 weeks after injection. g, Percentage time freezing during 


with two additional control experiments, either directly driving a 
different (septo-hippocampal) projection (Fig. 2f, g) or directly driv- 
ing the hippocampus itself (Fig. 2h, i). Preservation of normal con- 
textual fear memory in these mice was confirmed as before (Extended 
Data Fig. 2d, e; n = 8, P< 0.001, paired t-test). 

These experiments demonstrated that activating AC-CA project- 
ing cells was sufficient to induce contextual memory retrieval; we next 
tested necessity by targeting the inhibitory opsin eNpHR3.0 to cells 
giving rise to the projection (Extended Data Fig. 2f, g), with light 
targeted focally and bilaterally to AC-CA cell bodies (Fig. 2j). We 
observed striking deficits in both the latency (Extended Data Fig. 2h) 
and the strength (Fig. 2k; n = 12 for eNpHR3.0 group, n = 8 for eYFP 
group, P< 0.001, two-way ANOVA with repeated measures) of the 
fear response in the eNpHR3.0 group (Supplementary Video 4) com- 
pared with eYFP controls (Supplementary Video 5). This effect was 
fully reversible (Fig. 2k). We also found that eNpHR3.0 mice demon- 
strated intact auditory cued memory recall (Fig. 21; n= 8 for all 
groups, not significant with paired t-tests), confirming that the loss- 
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day 2: 10s time bins. Right, summary bar graph (not significant, unpaired 
t-test). h, AAVdj-CaMKIIo::ChR2-eYFP (or eYFP alone) injection and 

light targeting to the dorsal hippocampus 5 weeks after injection. i, Percentage 
time freezing during day 2, 5s time bins. Right, summary bar graph (n = 8; 
not significant, paired t-test). j, CAV-Cre was injected into the dorsal 
hippocampus, and a Cre-sensitive double-floxed inverted orientation 
(DIO)-eNpHR3.0 (or DIO-eYFP) was injected into the AC, and light was 
delivered above the cell bodies in the AC 8 weeks after injection. k, Percentage 
time freezing in context A, day 2 before light (dark bar) versus after light 
(lighter bar). This effect is reversible on day 3 (to the right of the dotted line). 
eNpHR3.0: n = 12; eYFP: n = 8; P< 0.001, two-way ANOVA with repeated 
measures. 1, Cued conditioning on day 1 followed by retrieval to the tone 

on day 2 during light on/off sessions. NS, not significant. All data are presented 
as mean + s.d. *P< 0.05, **P < 0.01, ***P< 0.001. 


of-function experiments described earlier represented a hippocam- 
pus-specific effect of the AC-CA projecting cells. 

Taken together, these anatomical, electrophysiological and beha- 
vioural data reveal the existence of a previously uncharacterized 
monosynaptic PFC-to-hippocampus projection. When this circuit 
is inhibited, fear-conditioned mice are unable to retrieve the fear 
memory with the same strength or speed as control counterparts, 
indicating endogenous importance for memory retrieval. In contrast, 
activation of this circuit is sufficient for robust fear memory retrieval 
in recently conditioned mice, but not in naive unconditioned mice, 
mice in which the memory has been extinguished, mice not expres- 
sing opsin, or mice receiving other types of direct or indirect drive of 
hippocampus. 


Highly correlated neurons emerge during learning 


To observe real-time influences of the AC-CA projection on hippo- 
campal network activity, we adapted the fear-conditioning paradigm 
to head-fixed mice navigating in a virtual environment on an axially 


29 OCTOBER 2015 | VOL 526 | NATURE | 655 


©2015 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


b Day3 Day 30 c 
a = pe Exposure retrieval retrieval 
= g e Training *** a 
Air puff (a) 
ww Lick port 


Lick rate (Hz) 


30 


‘Boa bf 
hill 


(ML MU} LAL 
| 
d Day 1: exposure to contexts Day 2: training 
| 160 i _ a 
120 
Fear 
context 80 
2 40 
2 - 4 
3 % 20 40 60 80 
xe} 
5 160: 
Q 
— 120. 
Neutral 2 


context 80 


Number of correlated pairs 


1k g 
-— a4 2 15 
oO 
ee1 
S25 
5 oD 
a 400 i) 0 
fe} 
2 300 2 
o 
© 200 
-333 0 +333 
101 Time (ms) 
Y) Fear context Pa t=? 
10 Neutral context A 


Percentage of 
cells active 
a 


— t= 200 


5 
PCA1 


150 200 250 
Time (s) 


0 50 100 


10 ye%5 10 © 


Figure 3 | Memory formation generates highly correlated HC neurons that 
represent context. a, Head-fixed virtual reality setup. Fear was quantified by 
lick suppression (Methods). P1, projector 1; P2, projector 2. b, Lick rates in 
fear (black) versus neutral (grey) contexts (mean + s.d.; n = 12; P< 0.001, 
paired t-test). Sample day 3 raw lick profiles in the fear (top) versus neutral 
(bottom) context. Scale: 1 V, 20s. c, Histology performed 4 weeks after 
injection/surgical implantation showing implant above GCaMP6m-expressing 
neurons. Left: X10; scale bar, 200 um. Right: X20; scale bar, 50 um. d, Histo- 
grams showing number of correlated partners for each neuron in the fear 
versus neutral contexts for a representative mouse (see Extended Data Fig. 5). 
e, f, HC neurons in the fear context (red) (e) have few correlated partners in 
the neutral context (f). n = 4 mice (60 + 18.2 (s.d.) in the fear context versus 
18 + 15.8 in the neutral context; P< 0.01, paired t-test; see Extended Data 
Fig. 6). g, Raster plot (top) and collapsed activity histogram (bottom) during 


fixed track ball under a two-photon microscope™*. We used lick 
suppression, rather than immobility, as a measure of fear behaviour 
in water-restricted mice’ (Fig. 3a and Methods); mice learned this 
task and displayed significant lick suppression during retrieval in the 
fear context, indicating successful memory retrieval (Fig. 3b; n = 12, 
P<0.01, paired t-test). For imaging during behaviour, mice were 
injected with the genetically encoded Ca** indicator GCaMP6m”*, 
implanted with a cranial window above CA2/CA3 (Fig. 3c; confirma- 
tion of normal hippocampal physiology and behaviour in these mice 
is shown in Extended Data Fig. 3), and imaged daily in both contexts 
during training and retrieval (Extended Data Fig. 4a). We targeted 
imaging preferentially to CA3 neurons by both stereotactic position- 
ing of the imaging field of view (FOV) (Fig. 3c), and by verifying that 
visualized neurons displayed dendritic processes traveling in the 
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memory retrieval in one mouse; representative HC neuron time series is 
overlaid (red). h, HC neuron activity onset (time 0) compared with onset 
activity of their correlated pairs (n = 67 HC neurons, 60.3 + 6% leading versus 
23.2 + 10% lagging; P< 0.01, unpaired t-test). i-k Synchronous activity 
(Methods) was quantified (n = 5, 8.1 + 4 events in the fear versus 4.2 + 1.6 in 
the neutral context; P< 0.01, paired t-test) (i), and was not accompanied by 
significant changes in velocity (j) or lick rate (k) (n = 5 mice; not significant 
(NS), paired t-test). FC, fear context; NC, neutral context. 1, Principal 
component analysis (PCA) from a representative mouse (Extended Data Fig. 9 
for additional data sets). Population trajectory in the fear (red) versus the 
neutral context (blue) was projected onto the respective first three principal 
components. Right: AF/F traces of HC neurons and their correlated neurons 
participating in each deflection. F, fluorescence; Scale: 400% AF/F, 20s. All 
error bars represent s.d. *P < 0.05, **P < 0.01, ***P < 0.001. 


focal plane of this dorsal view (Supplementary Video 6; in contrast, 
CA2 or CAI neurons show processes traveling out of the focal plane 
due to the curvature of the pyramidal layer); however, some CA2 
neurons are likely to be included. In all cases, fast volumetric 
(500 X 500 ttm x/y, 100 jm z) two-photon imaging was performed, 
providing access to>400 neurons (Extended Data Fig. 4b). 

We first sought to identify features of the functional network that 
differed consistently across days or contexts. Many features were 
indistinguishable, including mean response magnitude of active neu- 
rons, mean activity event rate, mean activity event duration, and 
spatial distribution of active neurons (Extended Data Figs 4c-f 
and 5a). However, we noted a significant increase in the number of 
neurons active during memory retrieval in the fear context (Extended 
Data Fig. 5b; n =5 mice); interestingly, this was accompanied by 
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a significant decrease in mean correlated activity. Although prior 
studies reported increased correlated activity after learning””*', a par- 
simonious unifying explanation could be that reduced mean corre- 
lated network activity reflects a state of greater sparsity after learning, 
in which few neurons engage in considerably higher correlated activ- 
ity while most become de-correlated. Indeed, binning the number of 
correlated partners for each neuron revealed a significant distribution 
shift (Fig. 3d and Extended Data Fig. 5d), from randomly organized 
Poisson-like correlation distributions before learning to more 
ordered, power-law-like distributions after learning, with the emer- 
gence of a small population of neurons displaying highly correlated 
activity in the network (HC neurons). Importantly, these network 
properties were specific to the learned context, and indeed, at the 
single-neuron level, the highest levels of cell-by-cell correlation were 
seen in the fear context compared to the neutral context (Fig. 3d and 
Extended Data Fig. 5d; n =5; P<0.01, paired t-test). Additional 
quantitative properties were assessed for differential representation 
of the fear and neutral contexts (Extended Data Fig. 5d-i), revealing 
that the power-law exponent b provided the largest contribution to 
this context separation (Extended Data Fig. 5c; n = 5, P< 0.01), con- 
sistent with the emergence of HC neurons representing aspects of fear 
memory retrieval. Intriguingly, HC neurons in the fear context 
(Fig. 3e, shown in red) tended to be neurons that had a low degree 
of correlated partners in the neutral context (Fig. 3f and Extended 
Data Fig. 6), suggesting that the emergence of HC neurons after 
learning does not stem simply from strengthening of pre-existing 
correlated cell assemblies. 

To understand better the importance of these HC neurons, we next 
focused analysis on the activity of the entire network at times when the 
HC neurons were active. The HC neurons tended to lead rather than 
lag their correlated pairs (Fig. 3h), which were spatially distributed 
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throughout the volume (Extended Data Fig. 7a, b). Furthermore, 
although overall cell-by-cell correlated activity was reduced during 
the fear retrieval test, significantly more population-wide synchron- 
ous events (Fig. 3g, i), which were confirmed to be not related to 
motion (Fig. 3j, k) and consisted of essentially orthogonal groups of 
neurons (Fig. 31 and Extended Data Fig. 9), occurred in the fear 
context; HC neurons were found to lead these broad synchronous 
events (Extended Data Fig. 7c; P< 0.001, Kolmogorov-Smirnov 
two-tail test), with 78% of HC neurons active within the first 20% 
of a synchronous event. This event-leading nature could be consistent 
with a role for HC neurons in recruiting network activity. 

Importantly, these analyses were designed to limit the effects of 
potential confounds of slow fluctuations in the signal (for example, 
GCaMP6m and Ca?" kinetics) on correlations between neurons (see 
Methods). Additionally, a fast non-negative deconvolution analysis” 
(detailed fully in Methods), for detecting onset of activity while 
removing slow decay kinetics, yielded consistent results as described 
earlier for the increased pair-wise correlations in the fear versus neut- 
ral context, the event-leading nature of HC neurons compared with 
their correlated pairs, and the increased synchronous events observed 
in the fear versus neutral context (Extended Data Fig. 8). 


AC-CA projections target HC neurons during retrieval 

These volumetric imaging studies during memory retrieval dem- 
onstrate the emergence of a sparse set of HC neurons characterized 
by high correlations and leading of local synchronous events. 
Such neurons could serve as efficient points of access if preferentially 
recruited by top-down projections during memory retrieval. To 
test this idea, we sought to stimulate AC-CA projections while 
simultaneously imaging the postsynaptic hippocampal network to 
observe local dynamics directly. Current in vivo-tested red-shifted 
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Figure 4 | The AC-CA projection preferentially recruits HC neurons 
during memory retrieval. a, Left, AAV8-CaMKII::bReaChES-eYFP or AAV8- 
CaMKII::C1V17;7-eYFP (C1V 11,7 denotes the red-light-activated channel- 
rhodopsin engineered and characterized in refs 33 and 47) was injected into the 
AC, and fibre terminals were visualized in CA3 (red). Scale bars, 60 1m (X20). 
Right, AAV8-CaMKII::tdT in the AC, and AAVdj-CaMKII::GCAMP6m in 
CA2/CA3. Histology 8 weeks after cannula implantation showing preservation 
of AC-CA projections (red) near GCaMP6m-expressing neurons (green). Scale 
bar, 50 um (X20). b, Two-photon imaging and full-field optogenetic stimulation 
setup (Methods). Green indicates GCaMPé6m signals; red, stimulation 
artefact. D1, 680 nm short-pass dichroic; D2, 594 band/NIR long-pass dichroic; 
D3, 555 nm long-pass dichroic; FC, fibre couple; F1, 520 + 22 nm filter; F2, 
624 + 20 nm filter; L, 25 mm planoconvex lens; MMF, multimode fibre; NIR BF, 
near infrared blocking filter; PMT1 and PMT2, GaAsP PMTs; TL, tube lens. 
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c, Z-projection images (mean over time) at two depths (40 tm apart) from a 
representative mouse before and after training. Scale bars, 60 jim. d, Proportion 
of cells responding (CDF, cumulative distribution function) to AC-CA 
stimulation as a function of latency (n = 4 mice; 12 trials, P = 0.002, 
Kolmogorov-Smirnov two-tail test, « = 0.2673). Sample traces in inset (red box: 
stimulation duration). Consecutive trials are overlaid. e, Optical stimulation in 
the neutral context induces significant lick suppression after training (n = 4 
mice, 12 total trials, mean + s.d., P< 0.01, paired t-test). No lick suppression 
was seen before training (not significant, paired t-test). FC, fear context; NC, 
neutral context. f, Fraction of fear- and neutral-context HC and non-HC 
neurons recruited during before- (n = 3 mice, 10 total trial) and after-training 
optical stimulation (n = 4 mice, 12 total trials, P< 0.001, one-way ANOVA); 
mean, quartile, minimum and maximum are shown. **P < 0.01, ***P < 0.001. 
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opsins, however, were not suitable because, as we and others have 
found, C1V1 (ref. 33) does not traffic well to the terminals of long- 
range projections, and ReaChR™ (despite adequate trafficking) shows 
lower photocurrents and reduced spike fidelity in mammals. By 
extensively modifying ReaChR to include ChETA-based* and other 
mutations, we generated a red-shifted opsin termed bReaChES with 
strong photocurrents, high spike fidelity (Extended Data Fig. 10) and 
robust trafficking in long-range projections (Fig. 4a). 

Mice were injected with GCaMP6m in CA3 and bReaChES in the 
AC, and implanted with a cannula above CA2/CA3 (Fig. 4a) for 
simultaneous one-photon stimulation of projection terminals and 
two-photon imaging of CA2/CA3 pyramidal neurons through the 
same window (Fig. 4b). To test the causal effect of the projection, 
we performed multiple optical-stimulation trials both before and after 
fear conditioning, while tracking the same neurons across contexts 
and days (Fig. 4c). While trial-to-trial variability existed in the num- 
ber and identity of neurons activated, fear conditioning was found 
consistently to increase the fraction of cells that were time locked to 
the onset of optogenetic stimulus of the top-down projection (Fig. 4d; 
P= 0.002, Kolmogorov-Smirnov two-tail test). Finally, we directly 
tested recruitment of the memory-associated HC neurons by this 
projection. To do this, we first established that, consistent with earlier 
results, head-fixed mice were able to learn the contextual fear con- 
ditioning task and that stimulation of the AC-CA projection induced 
fear memory retrieval only after training and not before (Fig. 4e). 
Quantification over many trials indicated that stimulation of the 
AC-CA projection recruited relatively few (~5%) HC neurons in 
either the fear or neutral context before training, whereas there was 
a marked increase (~20%) in the fraction of HC neurons recruited 
after training; recruitment of non-HC neurons in the fear context, and 
recruitment of any neurons in the neutral context, remained 
unchanged and low (Fig. 4f). These results further demonstrated swift 
reorganization of the functional impact of the AC-CA projection, 
with preferential recruitment of HC neurons associated with the 
recently formed contextual fear memory. Together, these findings 
reveal a means by which top-down circuit influences could organize 
and engage with salient memory representations to enable efficient 
retrieval. 


Discussion 


We have identified a direct projection from the AC to the hippocam- 
pus with properties that mediate retrieval of recently encoded mem- 
ory traces. The ability of the AC to select appropriate targets among 
hippocampal pyramidal neurons during context retrieval points to 
potential reciprocity between these two regions; the hippocampus is 
well positioned to inform the AC regarding context through bottom- 
up pathways, after which the AC can access and mobilize engrams in 
the hippocampus for top-down control of retrieval. The AC may also 
form an independent representation during training, as suggested by 
the speed (within 1 day post-training) with which the AC-CA pro- 
jection can mediate top-down retrieval. This rapid retrieval also 
demonstrates that the AC is engaged early during the memory encod- 
ing process, as has been suggested”""', which, together with the pre- 
vious finding of hippocampal engagement even at later time points”, 
suggests a shift towards viewing the AC and hippocampus, and their 
bidirectional communication, as having shared involvement in both 
early and late stages of memory. The speed and specificity of the top- 
down retrieval also underscores likely plasticity at the AC-CA term- 
inals for dynamic access to recently allocated neurons in the hip- 
pocampus. 

We have found that this top-down control over contextual memory 
involves preferential recruitment of a sparse set of hippocampal neu- 
rons that are highly correlated in the local network and that tend to 
lead population-wide synchronous events, but only in the memory 
context and in a manner only seen after training. Sparsification of 
memory-associated networks during learning has been observed***”° 
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and computationally predicted”**’”, since hierarchical networks with 


few extensively connected hub-like nodes, as observed here, are well 
suited for memory stability (random degradation of the network will 
probably affect non-hub nodes that are less consequential to the 
memory representation). But beyond memory stability, the emer- 
gence of contextual memory-specific HC neurons could facilitate 
efficient access to engrams, in a process suitable for memories that 
are strongly encoded via repetition, reward, or emotional saliency. 
This circuit property could be broadly relevant, helping to explain 
how even single cortical neurons can in some cases drive network 
activity and behaviour***. 

In the future, improvements in two-photon stimulation of indi- 


vidually targeted neurons over large three-dimensional volumes, 


together with simultaneous recording of network activity”, may 


enable further delineation of the causal role of HC neurons during 
memory retrieval. Also of importance is the exploration of the 
molecular and structural properties of the observed HC neurons, 
and their inputs and outputs, through high-content methods such 
as single-cell RNA-sequencing and CLARITY (a method for creating 
labelled hybrids or composites of biomolecules in tissue covalently 
linked to acrylamide-related polymer hydrogels, which allows 
removal of unlinked tissue elements to create transparency and 
high-resolution accessibility to the macromolecular labels)*°. More 
broadly, further study of the AC-CA and other top-down projections 
with the circuit tracing and control methods described here may help 
to elucidate brain-wide regulation of memory in normal behaviour 
and in maladaptive states involving aberrant communication between 
PFC and hippocampus. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Animals. Wild-type C57B16/J male mice were group housed three to five to a cage 
and kept ona reverse 12 h light/dark cycle with ad libitum food and water (except 
in virtual reality behaviour experiments, for which water was restricted; details 
later). Experimental protocols were approved by Stanford University 
Institutional Animal Care and Use Committee (IACUC) and meet the guidelines 
of the National Institutes of Health guide for the Care and Use of Laboratory 
Animals. The target number of subjects used in each experiment was determined 
based on numbers reported in published studies. No statistical methods were used 
to predetermine sample size. 

Anatomical tracing and histology. Viral injections were carried out under pro- 
tocols approved by Stanford University IACUC and were performed in mice 
anaesthetized with 1-2% isoflurane using a stereotaxic apparatus (Kopf 
Instruments). For retrograde tracing, 4-5-week-old wild-type male mice were 
injected slowly (50 nl min‘) with small amounts (200 nl) of highly concentrated 
glycoprotein-deleted rabies virus tagged with tdTomato (RV-tdTomato)” in the 
dorsal hippocampus (A/P: —1.5mm; M/L: +1.75 mm; D/V: —1.8mm) with a 
1ul Hamilton syringe and a 35-gauge bevelled needle (World Precision 
Instruments) under the control of a UMP3 syringe pump (WPI). Following 
injections, the incisions were closed using Vetbond tissue adhesive (Fischer), 
and mice were allowed to recover and were housed for 5 days to allow for 
expression before their brains were collected for histological analysis. In the 
case of anterograde tracing, 4-5-week-old wild-type male mice were injected 
(150 nl min~') with 500nl of AAV5-CaMKIlo::eYFP (titre: 2 10’? vg ml~') 
in dorsal anterior cingulate (A/P: +1; M/L: —0.35; D/V: +1.2) and were housed 
for 30 days to allow for expression in terminals before collection of brains for 
histological analysis. 

For histological analysis, injected mice were transcardially perfused with ice- 

cold 1X PBS, immediately followed by perfusion of 4% paraformaldehyde (PFA). 
Brains were fixed overnight in PFA, then transferred to a 30% sucrose/PBS 
solution. Coronal sections of either 40 jum (for retrograde tracing with RV) pre- 
pared using a freezing microtome (Leica) or 300 jum (for anterograde tracing with 
AAVS5) prepared using a vibratome (Leica) were collected and stored in a cryo- 
protectant solution (25% glycerol, 30% ethylene glycol, in PBS) until further 
processing. For DAPI staining, slices were washed in PBS, incubated for 20 min 
with DAPI at 1:50,000, washed again in PBS, then mounted with PVA-DABCO 
(Sigma). A scanning confocal microscope (TCS SP5, Leica) and LAS AF software 
(Leica) was used to obtain and analyse images. 
Acute slice electrophysiology of AC-CA synapses. Acute brain slices were pre- 
pared from mice 6-8 weeks following viral injection with AAV5-CaMKIIo::: 
ChR2(H134R)-eYFP, to allow sufficient time for channelrhodopsin to express 
in axon terminals. After lethal anaesthesia, mice were transcardially perfused with 
cold sucrose slicing solution (see later) before decapitation, following which the 
brain was rapidly extracted and submerged in ice-cold sucrose-based slicing 
solution (234mM sucrose, 26mM NaHCO;, 11 mM glucose, 10 mM MgSO," 
7H20, 2.5 KCl, 1.25 mM NaHPO,'H20, 0.5 mM CaCl,*2H20). Coronal hippo- 
campal slices (300 lm thick) were cut on a Leica vibratome (Leica VT1000S) in 
sucrose solution and then submerged in a hypertonic recovery solution (artificial 
cerebrospinal fluid (ACSF) at an 8% increased osmolarity) at 33°C for 15 min 
before being transferred to standard ACSF (123mM NaCl, 26mM NaHCOs;, 
1lmM _ glucose, 3mMKCl, 2mMCaCl,*2H2O, 1.25mM NaH»PO,°H20, 
1mM MgCl,"6H,0) for a further 45 min at 33°C, at which point they were 
transferred to room temperature. 

Whole-cell patch-clamp recordings from CA3/CA1 hippocampal neurons 
were performed on an upright Leica DM-LFSA microscope. Borosilicate glass 
(Sutter Instruments) pipette resistances were pulled to 3-6 MQ and filled with 
potassium gluconate intracellular solution (130mM KGluconate, 10 mM KCI, 
10mM HEPES, 10 mMEGTA, 2mM MgCl, pH adjusted with KOH to 7.3). 
Voltage and current-clamp recordings were performed using pClamp (Axon 
Instruments). Cells with leak current greater than —200 pA or series resistance 
greater than 35 MQ were excluded. Light stimulation was performed using a 300 W 
DG-4 lamp (Sutter Instruments) with an external filter for blue light (wavelength in 
nm/bandwidth in nm: 470/20). Light pulses (2-5 ms pulse width) were delivered 
through a X40, 0.8 NA water-immersion objective at 4-10 mW mm * light power 
density. Latencies were measured as light pulse start to EPSC initiation. 
Optogenetics and behaviour. After injection with the indicated virus (for 
example, CAV or RV, expressing ChR2, eNpHR3.0, or eYFP) at the appropriate 
location (for example, cingulate, hippocampus, or medial septum), as described 
in Fig. 2 and Extended Data Figs 2 and 4, 5-week-old wild-type male mice were 
implanted with implantable fibre-optic lightguides (IFLs) consisting of a 2.5-mm- 
diameter metal ferrule with 0.22 NA and a 200-j1m-thick protruding cleaved bare 
optic fibre cut to the desired length (Thorlabs) as previously described”, either at 
the injection site (typically ~0.2 mm dorsal to the injection site) or at the term- 


inals for stimulation experiments as indicated in the figure legends. For inhibition 
experiments, dual fibre-optic cannulas of 200 jim thickness and 0.22 NA spaced 
0.7 mm apart were used to target anterior cingulate bilaterally, and two-ferrule 
cannulas spaced 3 mm apart were used to target hippocampus bilaterally. Mice 
were typically allowed to recover and housed for 1 month to allow for adequate 
expression before behavioural testing. All animals undergoing behavioural 
experiments were acclimated to a 12 h reverse light/dark cycle, handled for several 
days, and before behavioural testing, were acclimated to the room in which 
experiments were to be conducted for at least 30 min. 

The fear conditioning apparatus consisted of a square conditioning cage 
(18 X 18 X 30cm) with a grid floor wired to a shock generator and a scrambler, 
surrounded by an acoustic chamber (Coulburn Instruments). The apparatus was 
modified to enable light delivery during retrieval testing. Contextual fear con- 
ditioning was performed by placing mice in the conditioning cage (visual cues: 
bare walls; tactile cues: grid floor; odour cues: 70% ethanol) for 6 min, while 
receiving four 2s shock pulses of 7mA each at 1 min intervals, with the first 
shock presented 2min after placing the mouse in the conditioning context. 
A fraction of animals of the same cohort were not fear conditioned, and instead 
served as a control group that were just exposed to the conditioning context for 
the same amount of time (6 min) but did not receive any associated shocks. The 
following day, all mice were tested in a different ‘neutral’ cage (visual cues: 
coloured shapes; tactile cues: smooth paper towel covered plexiglass floor; odour 
cues: 1% acetic acid) for light-mediated fear retrieval. 

For stimulation experiments, optical stimulation through the fibre-optic con- 
nector was administered by delivering light through a patch-cord connected to a 
473 nm laser in 30s light-on/1 min light-off sessions. During light-on sessions, 
stimulation was delivered at 20 Hz, 15 ms pulses, with 8-10 mW power at the fibre 
tip. On the third day, all mice were then returned to the original conditioning 
context for 2.5 min to assess intact natural fear memory retrieval. In some cases, 
subsequent extinction of fear memory was performed by placing mice in the 
original conditioning chamber for three consecutive days, for 5 min each, without 
shock. Light-induced fear retrieval was then tested in the neutral context 24h 
following the last extinction training session. Subsequent reinstatement was per- 
formed by again placing the animals back in the conditioning context for one 6 min 
interval and providing four 2 s shock pulses of 7 mA each at 1 min intervals. A final 
light-induced fear retrieval testing was performed 24h later as described earlier. 

For loss of function experiments, optical inhibition through a fibre-optic con- 
nector was administered by delivering light through a dual patch-cord connected 
to a 589 nm laser. Constant light at 8-10 mW was used at the fibre tip to deliver 
inhibition either at cell bodies or terminals. On the first day, both eNpHR3.0 and 
eYFP control groups were trained to contextual fear conditioning as described 
earlier, and on the second day, mice were allowed to perform retrieval as usual 
during light off for the first 2 min, to assess baseline freezing in each animal. Then 
light was turned on for the next 30s (not longer, as the potential for extinction 
related unfreezing could confound light-related unfreezing at time points suc- 
ceeding the typical 2-3-min retrieval protocol). Freezing scores during the 30s 
light sessions were compared with the per cent freezing during 30 s of the imme- 
diately preceding light-off sessions. On the third day, all mice underwent retrieval 
in the conditioning context for 2 min with light off to test for reversal of light- 
induced behaviour. After context conditioning and retrieval, all mice subse- 
quently underwent auditory-cued conditioning (cued conditioning was done 
separately from context conditioning to ensure robust conditioning to both con- 
text and cue, since when performed together, mice often develop robust condi- 
tioning to tone (the more salient cue) and only weak conditioning to context). To 
perform auditory-cued fear conditioning, mice were placed in a different context 
(with coloured shapes as visual cues and a smooth floor), for 6 min, where after 
the first 2 min, four 20s auditory cues consisting of 2.9 kHz tone was played at 
1 min intervals, each followed by a 2s 7mA shock. Retrieval on the subsequent 
day was performed by presenting the tone four times (two during light offand two 
during light on) at 1 min intervals and per cent freezing was assessed during the 
20s post-tone compared with the immediately preceding 20s during tone, for 
both light-off and light-on conditions. Latency measures were performed as 
separate experiments, using the same cohorts; after finishing contextual and cued 
conditioning, these mice were retrained (contextually fear conditioned) to the 
first conditioning context. On the following day, 2 min retrieval was performed in 
the conditioning context with light on the entire time to test for latency to 
freezing, where latency was defined as the first instance in time that the animal 
was immobile for 5 consecutive seconds. Freezing in all experiments was scored 
by an experimenter blinded to the treatment group. Randomization of animals to 
experimental and control groups was performed by an experimenter with no 
explicit randomization algorithm used. All of the results were analysed by 
Student’s t-test or two-way ANOVA, followed by post-hoc tests, as applicable. 
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Hippocampal cranial window. C57BL/6] male mice were injected with 500 nl of 
AAVdj-CaMKIIo::GCaMP6m in CA3 (A/P: —1.7, M/L: +1.9, D/V: —1.9) and 
allowed to recover for at least 1 week before surgical implantation of a cranial 
window above CA2/CA3 for optical access similar to previously described hip- 
pocampal preps”. Briefly, mice were injected with 80 mgkg”'/6 mgkg’ ' of keta- 
mine/xylazine intraperitoneally, and maintained under 1.0-2.0% isoflurane 
throughout. For optimal window placement to access CA2/CA3, the mouse’s 
head was angled during surgery such that the skull location at the CA2/CA3 
injection site was level and exactly perpendicular to dorsal views of the head. A 
circular titanium headplate (7 mm in diameter) was centred over CA2/CA3 and 
adhered to the skull with adhesive cement (Metabond; Parkell) and a ~3 mm 
craniotomy was made in the centre using a trephine (Fisher). Parts of cortical 
region S1 and of parietal association cortex were vacuum-aspirated, with care 
taken to avoid the ventricle, until white matter was visible above the hippocam- 
pus. Vacuum aspiration was done with a 27-gauge blunt needle while irrigating 
with chilled 1X PBS. The top layer of white matter above the hippocampus was 
further removed by vacuum aspiration with a 31-gauge blunt needle, but care was 
taken to preserve deep layers of external capsule and the alveus (to preserve 
afferents and efferents to hippocampus). A forceps was used to manually insert 
a cylindrical borosilicate glass implant until the floor of the implant rested against 
the hippocampus. The implant was constructed from a 3.0-mm-diameter glass 
capillary tube (Friedrich & Dimmock) custom cut to 1.5 mm length, adhered on 
one end to a 3.0mm diameter coverslip of #0 thickness (Warner Instruments) 
using UV-curing optical glue (Norland Products). The top of the implant extrud- 
ing from the craniotomy was then secured to the skull using Metabond adhesive 
cement. After surgery, mice were given 5 mgkg ' carprofen subcutaneously and 
allowed to recover for at least 1 week before behaviour training. 

To ensure that the above manipulations (including GCaMP6m virus injection 

into CA3, GCaMP6m expression, and surgical excavation of certain regions of 
cortex) did not affect normal physiological properties of the hippocampus, we 
performed control experiments to assess Ca**-dependent physiology in weakly 
versus strongly expressing CA3 neurons in vitro, spontaneous activity in weakly 
versus strongly expressing CA3 neurons in vivo, and behavioural measurements 
before and after placement of the cannula (Extended Data Fig. 3). 
Virtual reality behaviour. We used a custom built virtual reality environment, 
modified from previously reported versions***'. A 200-mm-diameter styrofoam 
ball (Graham Sweet Studios) was axially fixed with a 6-mm-diameter assembly 
rod (Thorlabs) passing through the centre of the ball and resting on 90° post 
holders (Thorlabs) at each end, allowing free forward and backward rotation of 
the ball. Mice were head-fixed in place above the centre of the ball using a head- 
plate mount”. Virtual environments were designed in game development soft- 
ware Unity3d (http://www.unity3d.com). The virtual environment was displayed 
by back-projection onto projector screen fabric stretched over a clear acrylic 
hemisphere with a 14-inch diameter placed ~20cm in front of the centre of 
the mouse. The screen encompasses ~220° of the mouse’s field of view. The virtual 
environment was back-projected onto this screen using two laser-scanning projec- 
tors (Microvision), each projector covering one half of the screen. To create a flat 
image on the three-dimensional screen, we warped the two-dimensional image of 
the virtual environment using video manipulation software (Madmapper). The 
game engine allowed scripts written in JavaScript or C# to trigger external events 
based on the mouse’s interactions with the virtual environment by communicating 
over a TCP socket to custom Python control software. A LabJackU6 (http://lab- 
jack.com) was used to time-lock virtual environment events and imaging frame 
times, to record mouse licking behaviour with incoming TTL pulses from the 
lickometer (Island Motion), and to send TTL pulses to deliver solenoid-gated water 
rewards (delivered from a gravity-assisted syringe attached to tubing connected to 
the lickometer) and aversive air puffs (from a compressed air tank to a tube ending 
in a pipette tip facing the mouse’s snout). Tactile and odour cues were fixed directly 
to each of two Styrofoam balls representing the two separate contexts. Auditory 
stimuli were presented through speakers situated behind the animal. The mouse’s 
movements on the ball were recorded using an optical computer mouse (Logitech) 
that interfaced with the virtual environment software. 

For fear conditioning in the virtual environment, mice were water restricted 
(>80% pre-deprivation weight) and habituated to handling, head-fixation, and 
the virtual environment for at least 2 weeks, with free access to small water 
rewards (~0.5 ulper 10 licks) while on the ball. By the end of 2 weeks (one 
5-min session per day), mice appeared comfortable and alert on the ball. After 
habituation, mice underwent a 4-day fear conditioning training and testing pro- 
tocol. On day 1, mice were exposed to two contexts that differed in visual (blue 
triangles versus pink vertical stripes), tactile (smooth side of Velcro versus sharp 
side of Velcro fixed onto running ball), odorant (acetic acid versus ethanol), and 
auditory cues (8 kHz phasic tone versus 3 kHz pure tone) for 5 min each. On day 
2, mice were provided with 8 aversive air puffs to the snout (500 ms, 10 psi) at 
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randomly timed intervals throughout the 5 min while in the fear context, but not 
while in the neutral context for 5 min. On days 3 and 30, mice were placed back in 
each of the two contexts for 5 min for retrieval. 

Imaging. Five mice were imaged on all days, in 5-min sessions, during exposure, 
training, and retrieval. We used a resonant galvanometer two-photon microscope 
(Prairie Technologies). We used the genetically encoded calcium indicator 
GCaMP6m in all experiments (GCaMP6m was amplified from Addgene plasmid 
#40754 by PCR and subcloned into an AAV backbone under the control of the 
CaMKIla promoter.) All experiments were performed using a Coherent Ultra II 
Ti-Sapphire pulsed laser tuned to 920 nm to excite GCaMP6m through a X20 0.5 
LUMPlanFL/N (Olympus) water-immersion objective interfacing with the 
implanted cannula through a few drops of distilled water. Fluorescence was 
detected through gallium arsenide phosphide (GaAsP) photomultiplier tubes 
(PMTs) using the PrairieView acquisition software. High speed z stacks were 
collected in the green channel (using a 520/44 bandpass filter, Semrock) at 
512 X 512 pixels covering each x-y plane of 500 ttm X 500 jim over a depth of 
~100 um (3-7 z slices ~ 10-20 tm apart) by coupling the 30 Hz rapid resonant 
scanning (x-y) to a Z-piezo to achieve ~6 Hz per volume. 

Data analysis. Later, we describe the methods to extract cells (pre-processing), 
obtain cellular-level activity (AF/F) measures (processing), and evaluate popu- 
lation-level activity measures (post-processing). In statistical analysis of the post- 
processed data, both parametric and non-parametric tests were employed as 
appropriate. In cases where normality could not be assessed (low sample sizes), 
we ensured that there were no significant outliers (by Grubbs’ test) and that the 
variance between groups was not significantly different (by Levene’s Test). 
Pre-processing (cell extraction). Time series data sets were x-y motion cor- 
rected with Image] plug-in Stack Reg using rigid body transformations. Cell 
extraction was then performed sequentially, by first computing cell segments 
automatically followed by manual quality control for missed cells, non-cells, or 
conjoined cells. For initial automatic extraction, we used a metric based on 
image threshold intensity, variance and skewness. Images with high contrast- 
to-noise ratio, wherein clear thresholds in maximum intensity separated cells 
and background, were fully segmented with the former. In the remainder of 
cases, cells were distinguished from background based on standard deviation 
across time (high for active cells), or skewness (asymmetry) in intensity across 
time*’. This resulted in a general mathematical criterion to define cell-masks at 
each voxel location (i, j, k): 


MGij,k) = tr] Emax (i. j,k) > Fe) + Bel or ,j,k) > oc) + pL se G.j,k) > sc) 


where J is the indicator function (=1 if the condition is satisfied); op(i,j,k) is the 
standard deviation of intensity over time defined as 


o-°(injsk) =E|(Fikt) —F)| 


and skewness is defined as 


Eis the expectation operator; F., ¢. and s, represent cut-offs for image intensity, 
standard deviation and skewness respectively. Coefficients ap, bp and yp are 
chosen on an image-specific basis; if thresholding is sufficient Jy and yz are 
chosen to be zero, otherwise coefficients are iterated to obtain a cell mask 
containing the largest population of active cells (evaluated by inspection). 
Automatic cell extraction was then followed by manual cell-by-cell curation to 
identify cells that were not extracted using the automated algorithm. This occurs 
when cell boundaries may not be captured due to non-translational motion 
artefact in the original imaging, and/or lack of clear cut-offs F., o- and s, differ- 
entiating cell and background. For these cases, cell detection is performed with a 
manual editing step involving comparison of the automated cell-mask to the raw 
image data, and by using a Gaussian filter was applied on the edited image to 
smooth edges, and edge-detection™ was used to define cell boundaries. The 
interior of the resulting cells were filled, and the final cell masks were eroded to 
minimize contamination from neuropil signal. Each cell was labelled with a 
unique cell identifier for the next stage; custom-written MATLAB scripts were 
used for all steps, and are available on request. 
Processing. Calculation of AF/F. For each cell identified in step 1, the intensity 
value F was obtained by averaging over all pixels inside the ROI to compute a 
space-averaged value F for each frame (corresponding to a single time point). 
These are used to define AF/F in each cell as 


AF _ F —Fraseline 
F F baseline 
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where Fyaseline is the baseline fluorescence, calculated as the mean of the fluor- 
escence values for a given cell, continuously acquired over a 20s moving time 
window to account for slow time-scale changes in fluorescence. Given the sparse 
firing of neurons in our data set, the mean served as an accurate estimate of 
baseline activity (fluorescence). Furthermore, the main results of the study were 
not influenced by using the median or 8th percentile as the baseline (and correla- 
tions were independent of baseline definition). 

Statistical analysis of neuronal responses. We used an approach similar to that 
outlined previously™ to identify significant transients in each neuron, as well as to 
estimate and remove effects that may be related to motion artefacts. Briefly, to 
estimate the occurrence rate of potential motion-related fluorescence changes in 
the signal, all negative deflections in the AF/F trace were assumed to be due to 
motion. Because motion-related fluorescence changes should be equally likely to 
generate positive- or negative-going changes, positive and negative deflections in 
the AF/F curve that are attributable to motion should occur at the same frequency 
and can be subtracted out of the signal by using the rate of occurrence of the 
negative-going transients as an estimate of the rate of motion-related positive- 
going transients. 

To determine statistically significant transients, we first calculated an estimate 
of the noise for each cell using an iterative approach: (1) initialize a cut-off value 
that separates signal and noise, (2) calculate the standard deviation (¢) of all AF/F 
values that fall below the cut-off, and (3) compare 3c to the cut-off. In this 
analysis, the goal is to find an estimate of standard deviation (c) of the noise, 
defined for time periods that are unlikely to contain neural events (that is, using 
the iterative approach to estimate the o of the noise, rather than calculate standard 
deviation for the entire time epoch, which would contain real events). For each 
iteration of the analysis, if |cut-off — 30| <tolerance, the program terminates 
(where tolerance = 0.02). If cut-off >30, the program increases the cut-off by 
10% and goes back to step 1. If cut-off <3a, it reduces the cut-off by 10% and goes 
back to step 1. This approach helped ensure that neuronal activity-generated 
events in AF/F are not included in the estimation of noise and avoided the need 
for manually selecting epoch intervals on a cell-by-cell basis that did not contain 
an event in order to estimate noise. 

Subsequently, we analysed positive- and negative-going transients to fur- 
ther determine the false positive rate. Transient onsets are defined as the times 
when the AF/F exceeds 26 and offset is defined as the time at when a given 
transient falls below 0.5 a°. A histogram of the number of transients that exist 
for each o threshold value (that is, >20, >30, >40), for various durations, is 
extracted, where negative-going transients are to the left of the ordinate and 
plotted in red (Extended Data Fig. 4c—f). The ratio of the number of negative to 
positive going transients is calculated for different transient durations across 
three amplitude levels (20, 30, 4c), and serves as our estimate of false positive 
rate. Following from the reasoning described earlier, this ratio will be 50% 
when the motion-based noise significantly exceeds the signal. We plot the false 
positive ratio for the different scenarios described earlier, and choose the 
amplitude (in o) and duration cut off (Extended Data Fig. 4c-f) needed to 
reduce the false positive rate to below 5%. As mentioned previously’, it is 
important to note that this estimate of noise represents an upper bound, 
and could be influenced by other sources of noise apart from motion (that 
is, photon shot noise). 

Calculation of correlation coefficients between neuron pairs. The Pearson correla- 
tion coefficient was calculated between each pair of cells, c, and c,, as 


_ E((ca—Ca)(Cy —&)) 
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This metric measures linear dependence between signals in the two cells, and is 
invariant with respect to scaling or amplitude translation of the cell signals. We 
define a matrix of correlation coefficients of size Nees X Nees Wherein each entry 
corresponds to correlation between the cells identified by the corresponding row 
and column. To avoid accumulation in correlated signal due to slow drifts (for 
example, the long decay curve of GCaMP6m), we set all AF/F values lying outside 
the window of a significant transient (as defined earlier) to 0. 

Post-processing. Histogram of cell activity correlations. The property of high 
correlation (HC) was tested for in each neuron by finding the number of 
correlated neurons with which the Pearson’s correlation coefficient was above 
0.3 (a Pearson correlation cut-off of 0.3 was used as a conservative estimate of 
connectivity since previous studies using in vivo two-photon calcium imaging 
followed by paired whole-cell recordings reported a greater than 50% chance 
of connectivity when correlations of Ca”* signals exceeded 0.3 in vivo)®**. 
Histograms were obtained by binning this number across neurons in steps of 5 
and calculating the number of neurons that fell into each bin, with the resulting 
histogram representing the degree distribution of all neurons in the network. 


HC neurons were defined as those neurons that had more correlated partners 
than that of the average neuron in the same volume by >1 standard deviation. 
Optimally separating hyperplane. To identify network population activity mea- 
sures that best distinguished fear and neutral contexts, we used a space of graph 
theoretic parameters (described later), which together can be used to define an 
optimally separating hyperplane between the two contexts. Mathematically, this 
is posed as a constrained optimization problem, with the objective function 
seeking to maximize the sum of distances of the hyperplane to the nearest data 
points in each context, and the constraint being that the hyperplane separates the 
two contexts. This constrained optimization problem was solved using Lagrange 
multipliers. 

Synchrony and quantification of lead-lag. To analyse the spontaneous activity of 
the entire network, we computed the onset and duration of each activity transient 
(where event onsets and offsets are calculated as described earlier) for each 
neuron, and then combined transients from all cells into raster plots and col- 
lapsed these raster plots into activity histograms, which indicated the percentage 
of active cells as a function of time. 

To identify epochs of synchronous activity that included more active cells than 
would be expected by chance at each frame, we used interval reshuffling (ran- 
domly reordering of intervals between events for each cell), performed 1,000 
times for each mouse in each context, such that a surrogate histogram was con- 
structed for each reshuffling. The threshold percentage of active neurons corres- 
ponding to a significance level of P< 0.05 (appearing only in 5% of histograms) 
was taken to be the per cent of coactive cells required in a single frame to be 
considered a synchronous event, and this threshold ranged between 2.5% and 5% 
active neurons per frame across all mice and fields of view. At least three con- 
secutive frames with activity above the significance threshold were required to be 
considered a synchronous event, and all subsequent contiguous frames above this 
threshold were grouped together into the same synchronous event. To plot the 
cumulative distribution function of event onsets for HC and non-HC neurons 
during synchronous events, all synchronous events across all mice were iden- 
tified, and the onset times of HC versus non-HC neurons were binned per frame 
and plotted cumulatively as a function of the percentage of time elapsed during 
the synchrony window. 

To quantify whether the activity of HC neurons was leading or lagging 
their correlated pairs, the event onset of the HC neurons (defined as the first 
instance when the signal exceeded 3a for time consecutive frames) was first fixed 
at t = 0. The event onsets of all correlated pairs were then binned into 0.167 s time 
windows immediately preceding or succeeding the onset of the hub neuron 
at t=0. 

PCA. PCA was used to describe and visualize population activity of all neurons 


AF 
over time in each context. This was done by transforming the = of each cell 


(typically ~500 cells per mouse per context), over all time points in a given 
context (typically 1,800 frames) to a different coordinate system characterized 
by linearly independent eigenvectors, where each eigenvector represents a 
weighted combination of the different cells. Eigenvalues were sorted in decreasing 
order to reveal the most energetic (contributory) eigenvectors as well as the 
magnitude of their overall contribution. PCA was performed using eigenvalue 
decomposition of the correlation matrix. The corresponding eigenvalues and 
eigenvectors were calculated using custom MATLAB scripts. 

Estimation of graph-theoretic parameters. An undirected graph is defined based 
on the cell correlations in the population. An edge, E, is defined between neurons 
if they are correlated beyond the threshold described earlier. The undirected 
neuronal graph G (V, E) is defined using all the cells, which are denoted by V 
(vertices), and E (edges). Mean and maximum cell correlations are calculated 
using aggregate average and the maxima over all of the measured correlations, We 
also fit an exponential distribution between nor; (the number of correlations) and 
Nfreq (the degree distribution described earlier) to quantify how closely the graph 
mimics small world networks, which are characterized by a power law degree 
distribution 


~b 
Nfteq = (Morr) 


for which the power law parameters, a and b, are calculated by transforming 
the above equation into a logarithmic scale and performing a minimum least- 
squares fit. 

A neighbourhood is defined for each cell as a sphere of radius 30 jim. The 
clustering coefficient for a vertex is defined as the ratio of number of edges within 
its neighbourhood to the maximum number of connections possible. If there are 
k nodes in the neighbourhood, k(k — 1)/2 is the maximum number of possible 
connections’. The clustering coefficient of the entire network is defined as 
the mean clustering coefficient across all vertices. The mean path length is defined 
as the average path between any two randomly selected vertices of the graph. The 
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mean path length (mpl) is calculated by first constructing an adjacency matrix, 
which is an Me X Mey Matrix, and all correlated vertex pairs are given a value of 
one in the corresponding row and column, and zero otherwise. The minimum 
path from i to j can be recursively calculated using 


mpl; =min(mpl;,, + mplx;) 


Small-world networks are characterized by high clustering coefficient and low 
mean path length, quantified using the ratio of clustering coefficient to mean path 
length, where each term is normalized to a purely random graph with the same 
number of vertices. Betweenness centrality is a measure of the centrality of nodes 
in the network, and indicates how central a node is to communication between all 
pairs of node. Betweenness centrality is computed by calculating all possible paths 
between two nodes and calculating the number of those that pass through a given 
node. Strength of a graph quantifies how strongly different subcomponents of a 
graph are connected and is a measure of resistance of the graph to attack on its 
edges. Let P= (Vj, V2... V;,) denote all possible partitions of the graph G into a 
mutually exclusive set of vertices V,, V2... V,, such that the union of all the 
vertices is V. Let E, denote the number of edges that needs to be removed from 


E, 


G to create the partition P. Then the strength is defined as s= min ? where 


n— 
the minima are calculated over all possible partitions P. In other words, the 
strength quantifies how to remove minimal edges to create maximal separation 
among vertices of the graph. The strength is calculated using MATLAB code 
based on algorithms described previously”. 

Fast non-negative deconvolution algorithm implementation. Deconvolution 
algorithms enable the estimation of spike rate trains from fluorescence data. Here, 
we use deconvolution to estimate activity-event onset, not to detect single spikes, 
since GCaMP6m is assumed to neither have the linear response kinetics nor the 
sensitivity needed to detect single spikes from bursting neurons in the hippocam- 
pus. We used this analysis to help confirm our main results regarding synchron- 
ous events and timing of highly correlated neurons, since these analyses offer an 
alternative method to identify event onsets, while helping to remove noise (for 
example, long Ca’* signal decays) from the analyses. 

Many deconvolution algorithms exist. Early methods to deconvolve fluor- 
escence data used either thresholding to infer event onset” or optimizations to 
match a chosen spike profile. More robust algorithms such as the Wiener linear 
filter are promising®' but with practical value diminished since negative-going 
spikes are allowed. In 2010, Vogelstein and colleagues provided a fast non-nega- 
tive deconvolution method that is, in addition to imposing a non-negative con- 
straint on the spike trains, scalable on a large population of neurons*. Since our 
imaging involves hundreds of neurons over multiple contexts and days, we use 
the algorithm from Vogelstein et al. to deconvolve fluorescence signals. 

Three classes of parameters were optimized in this algorithm to fit data: (1) 
GCaMP-related parameters, namely sensitivity of fluorescence to elevations in 
intracellular Ca”* concentration (a) and baseline concentration (f); (2) acquisi- 
tion parameters, namely the size of the time bin (6) and the noise (a) in the AF/F 
signals; and (3) system (hippocampus/CA3)-related parameters, namely 
expected spike rate per second (g) and the time constant (zt), or the length it takes 
for Ca** concentrations to decay. For the GCaMP-related parameters, f is set to 
the baseline of the AF/F traces as described earlier, and « is set to 1 as a default 
value (since varying o broadly around this value did not affect the deconvolution 
results). 6 was set to 1/3 s because image acquisition was at least 3 Hz per optical 
slice, and o was estimated to be 0.16 as explained earlier. The main challenge 
resides in choosing parameters for y and t since (1) the expected spike rate (close 
to 0.1 Hz on average, but >10 Hz when bursting) is bimodal and insufficiently 
captured by the Poisson distribution of spikes as assumed by this model, and (2) 
the time constant expected for Ca* signals in hippocampus is not fully under- 
stood. Therefore, we optimized these two parameters by iterating over multiple 
combinations of time constants and expected spike rates to yield spike events 
consistent with good fits to our data (Extended Data Fig. 8). The final parameters 
chosen were: % = 1; f = baseline; 6 = 0.33 s; 6 = 0.16; gp =5 Hz; t=2s. These 
values were not exactly the same as, but were comparable to, values reported by 
others in cortical regions*°***. Importantly, varying g and t within a fairly broad 
range (y ~ 5-10 and t ~ 0.67-2) did not significantly alter the main conclusions 
of the subsequent analyses. The AF/F signals for all the mice, contexts and days 
were deconvolved. Correlation coefficients were calculated on the deconvolved 
signals, and metrics that rely on accurate estimate of event onsets were recom- 
puted, such as synchrony, lead-lag, and identification of HC neurons. The only 
difference from the methods described earlier was that there were no additional 
noise filters since the noise is filtered in the process of finding the optimal spike 
rate (here, event rate), and the onset time was characterized by the first instance 
that the signal became non-zero. Further analysis of the various specific decon- 
volution parameters would be of interest but would probably require combined 
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in vivo imaging and single-cell patching experiments, beyond the scope of the 
current study, and unlikely to significantly affect the specific analyses applied here 
given the robustness of results to broad ranges of parameters. Furthermore, we 
performed the analyses described earlier only to help ensure robustness in results 
obtained from using the raw AF/F for measurements relying on precise timing 
(correlations, leading versus lagging, and synchrony). 

Virtual reality behavioural analysis. Lick rates and movements on the ball were 
captured in XML log files storing timestamps of behavioural data. These were 
then parsed with custom Python scripts and imported into MATLAB for syn- 
chronizing with microscope imaging frames with kHz precision, and for sub- 
sequent analysis. To quantify differences in licking between fear and neutral 
contexts during retrieval, the number of licks per second (each lick causing a 
beam-break resulting in TTL pulse output of at least 1V), was integrated over the 
first 2 min in the context. Total licking amounts were normalized to the highest 
lick rate, observed from any mouse in any context, and presented as a fraction of 
this value for each mouse and each context. Lick suppression data are presented as 
mean values across all mice in each experimental group; significance values of 
differences between contexts were evaluated by Student’s t-test. Lick rates during 
optogenetic stimulation experiments were scored by quantification during the 15s 
of light delivery, which was then normalized to the corresponding value from the 
15 s just before light delivery. Significant differences in licking for fear versus neutral 
context, and for neutral/stimulated versus neutral context alone, were evaluated 
using Student’s t-test. Lick rates and velocity on ball during synchronous popu- 
lation activity events were calculated by comparing the amount of licking and 
distance travelled in the 5 s window beginning at the start of a synchronous event, 
and then normalizing to the amount of licking and distance travelled in the 5s 
window before synchronous event. Similar quantitative results were observed with 
this time window set to 1-10s after synchrony compared with before, with no 
significant difference in lick rate and velocity during versus before synchrony. 
Simultaneous 1P in vivo stimulation and 2P in vivo imaging. Simultaneous 
1-photon (1P) stimulation (594 nm) and 2-photon (2P) imaging (920 nm) was 
performed by injecting the new red-shifted opsin with improved trafficking and 
kinetics (bReaChES) via AAV8 in cingulate and GCAMP6m via AAVdj in CA3, 
and positioning a cranial window above CA2/CA3 for optical access. The 2P 
imaging and full-field optogenetic stimulation setup is shown in Fig. 4b. 
Briefly, a resonant galvanometer 2P microscope using an NIR pulsed laser set 
to 920 nm is combined with simultaneous, full field stimulation using a 594 nm 
continuous wave laser that is coupled into the system with an optical fibre, lenses 
and dichroic beam splitters. A 2P compatible NIR reflecting dichroic designed 
with an additional 594 nm band-pass filter was used for 1P yellow light stimu- 
lation during 2P imaging. GCaMP6m signals (green channel) and stimulation 
artefact (red channel—used to precisely blank stimulation time points) are 
recorded using standard 2P resonant scanning imaging. 1P stimulation artefacts 
were removed offline from the 2P images. Stimulation parameters: 591 nm light, 
20 Hz, 15 ms pulses, 15 s, 8-10 mW mm ” laser power at sample after the object- 
ive. In total, 4 mice (separate cohort from those used in the imaging-only experi- 
ments) were used for the combined stimulation and imaging experiments. The 
same cells and the same FOV are captured for before-training stimulation trials as 
well as after-training stimulation trials (conducted 5-7 days later). For a neuron 
to be considered responsive to (recruited by) the stimulus, at least one significant 
transient as defined earlier was required to occur during the stimulation window. 
For latency measurements provided in Fig. 4, event onsets were defined as the first 
time frame at which the response surpassed three standard deviations above 
noise, and increased for at least two consecutive frames; if occurring within the 
first frame, then only neurons with responses increasing from the previous frame 
are considered, to exclude responses decaying into the stimulation window. 
Responding neurons were assigned to latency bins of 333 ms. 

DSI electrophysiology. DSI is dependent on the increase of postsynaptic intra- 
cellular Ca** to suppress GABA release from presynaptic inhibitory neurons 
expressing cannabinoid receptors®’. We performed patch-clamp recordings from 
CA3 neurons expressing GCaMP6m and examined spontaneous inhibitory post- 
synaptic currents (sIPSCs) before and following a depolarizing pulse to induce 
Ca’* influx. Electrophysiological recordings were performed 4-6 weeks post- 
injection of AAVdj-CaMKIIa::GCaMP6m into CA3 (in 4-5-week-old mice). 
Coronal slices (300 fm) from injected mice were prepared after intracardial 
perfusion with ice-cold, sucrose-containing artificial cerebrospinal fluid solution 
(ACSF; in mM): 85 NaCl, 75 sucrose, 2.5 KCl, 25 glucose, 1.25 NaH,PO,, 4 
MgCh, 0.5 CaCl, and 24 NaHCO, . Slices recovered for 1 h at 32-34 °C, and then 
were transferred to an oxygenated recording ACSF solution (in mM): 123 NaCl, 3 
KCl, 26 NaHCOs, 2 CaCl, 1 MgCh, 1.25 NaH2PO, and 11 glucose, at room 
temperature. Excitatory synaptic transmission blockers (D-2-amino-5-phospho- 
novaleric acid (APV; 25 tM) and 2,3-dihydroxy-6-nitro-7-sulfamoyl-benzo[f]- 
quinoxaline-2,3-dione (NBQX; 10M)) were added to isolate GABAergic 
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postsynaptic currents, and 5 iM carbachol was used to enhance sIPSC frequency 
to facilitate detection of DSI. Recordings were performed at 32-34°C under 
constant perfusion of the oxygenated recording ACSF solution. Slices were visua- 
lized with an upright microscope (BX61WI, Olympus) under infrared differential 
interference contrast (IR-DIC) optics, and a Spectra X Light engine (Lumencor) 
was used for viewing GCaMP6m expression. Recordings of CA3 neurons were 
made after identifying GCaMP6m expression, and functional GCaMP6m activity 
was verifiable (in cells without BAPTA), by observing the increase in GCaMP6m 
fluorescence during the depolarizing pulse used to induce DSI. The following 
intracellular solution was used for the patch-clamp electrodes (in mM): 40 CsCl, 
90 K-Gluconate, 1.8 NaCl, 1.7 MgCl, 3.5 KCI, 10 HEPES, 2 MgATP, 0.4 Na,GTP, 
10 phosphocreatine (pH 7.2, 270-290mOsm). For the BAPTA experiments, 
40mM BAPTA was added to the intracellular solution. Series resistance was 
monitored for stability, and recordings were discarded if the series resistance 
changed significantly (by >20%) or reached 20 MQ. Resting membrane potential 
was taken at rest, and the reported values incorporate a liquid junction potential 
of +11.2 mV. Input resistance was calculated from a 100 pA pulse. MiniAnalysis 
(Synaptosoft) and pClamp10.3 (Molecular Devices) was used to calculate charge 
transfer (area under sIPSCs) and analyse data. Baseline charge transfer was mea- 
sured during a 4s pre-pulse period, DSI was examined during a 4s period fol- 
lowing the depolarizing pulse, and charge transfer after recovery from DSI was 
measured during a 4s window. The pulse used to evoke DSI was a 500 ms step to 
0 mV from holding potential of —65 mV. 

bReaChES design and testing 

Cloning and single mutagenesis of ReaChR and bReaChES. DNA sequences of 
ReachR and bReaCh were synthesized (GenScript) and cloned into AAV vectors 
containing the CamKIIo promoter for expression in neurons. All constructs were 
fused to eYFP DNA to detect protein expression in neurons by fluorescence micro- 
scopy. The Glu123Ser mutation was introduced using QuickChange Site-Directed 
mutagenesis kit (Agilent). Plasmid DNA was purified with QIAprep Spin Miniprep 
Kits (Qiagen) after transformation and amplification in Escherichia coli. 
Electrophysiological recordings in cultured hippocampal neurons. Electro- 
physiological recordings in neuronal cultures were prepared as described”. 
Patch pipettes (4-6 MQ) were pulled from glass capillaries (Sutter Instruments) 
with a horizontal puller (P-2000, Sutter Instruments) for whole-cell recordings in 
voltage and current clamp. Recordings were made using a MultiClamp700B 
amplifier (Molecular Devices). The external recording solution contained 
(in mM): 127 NaCl, 10 KCl, 10 HEPES, 2 CaCl,, 2 MgCl,, 30 p-glucose, pH 7.3, 
including synaptic blockers (25 1M D-APV, 10 14M NBQX). The patch pipette 
solution contained (inmM): 140 K-gluconate, 10 HEPES, 10 EGTA, 2 MgCl, 
pH7.3. All measurements were corrected for a liquid junction potential of 
+15 mV. Series resistance was monitored throughout recordings for stability. 
A Spectra X Light engine (Lumencor) was used to excite eYFP and to apply light 
for opsin activation. Yellow and red stimulation light was filtered by 575/25 or 
632/22 band-pass filters (Chroma) and applied through a X40 objective 
(Olympus) at 5mW mm * light intensity. Light power density was measured 
with a power meter (ThorLabs). The functionality of all constructs was deter- 
mined by comparing stationary photocurrents at — 80 mV to 1 s continuous light 
pulse. Spikes were optically evoked in current-clamp mode with light pulses 
(5ms) delivered at 633 nm, 5mW mm? and 1-20 Hz. The activation spectra 
for C1V1, bReaCH-ES and ChR2 was measured by recording stationary photo- 
current in voltage clamp mode at —80 mV and light intensities of 0.65 mW mm” 
at each wavelength. Light was delivered through 20nm band-pass filters 
(Thorlabs) at (in nm): 400, 420, 440, 460, 470, 480, 490, 500, 520, 540, 560, 
570, 580, 590, 600, 620, 630, 650. Photocurrents were normalized to maximum 
values respectively: 480 nm for ChR2, 560 nm for C1V1 and 570 nm for bReaCh- 
ES. Kinetics of channel closure were determined by fitting the decay of photo- 
currents after light off, with mono-exponential functions. Channel kinetics were 
quantified by corresponding Ts values respectively. pClamp10.3 (Molecular 
Devices) and OriginLab8 (OriginLab) software was used to record and 
analyse data. 

Stereotactic virus injection of bReaCh-ES. The following adeno-associated 
viruses (AAVs) with serotype DJ were produced at the Stanford Neuroscience 
Gene Vector and Virus Core: AAVDJ-CaMKII::bReach-E162S-TS-eYFP; and 
AAVDJ- CaMKII::C1V1(E122T/E162T)-TS-eYFP. Four-to-six-week-old mice 
were injected bilaterally with 1 ul of either virus in the medial prefrontal cortex, 
at the following coordinates (from Bregma): A/P: +1.7 mm; M/L: +0.3 mm; D/V: 
—2.5mm. Titre was matched at 1.5 X 10'? vg ml“! for both viruses. 


Slice electrophysiology for bReaCh-ES characterization in mPFC and BLA. 
Electrophysiological recordings were performed 12-14 weeks post-injection for 
opsin expression at the mPFC terminals. Coronal slices (300 um) from injected 
mice were prepared after intracardial perfusion with ice-cold, sucrose-containing 
ACSF (in mM): 85 NaCl, 75 sucrose, 2.5 KCl, 25 glucose, 1.25 NaH,PO,, 4 MgCl, 
0.5 CaCl, and 24 NaHCOs. Slices recovered for 1h at 32-34 °C, and then were 
transferred to an oxygenated recording ACSF solution (in mM): 123 NaCl, 3 KCl, 
26 NaHCO;, 2 CaCl, 1 MgCl, 1.25 NaH,PO, and 11 glucose, at room temper- 
ature. Electrophysiological recordings were performed at 32-34 °C under con- 
stant perfusion of the oxygenated recording ACSF solution. For mPFC 
recordings, synaptic transmission blockers (APV (25 uM), NBQX (10 4M) and 
gabazine (10 .M)) were added to the recording ACSF solution. Slices were visua- 
lized with an upright microscope (BX61WI, Olympus) under IR-DIC optics. A 
Spectra X Light engine (Lumencor) was used both for viewing fluorescent protein 
expression and delivering light pulses for opsin activation. Light power density 
was obtained with a power meter (Thorlabs). Recordings of mPFC neurons were 
made after first identifying regions of eYFP* expression, and recordings of post- 
synaptic basolateral amygdala (BLA) neurons were obtained after confirming 
eYFP* expression in both mPFC and the mPFC axonal fibres at the BLA. 
Whole-cell voltage-clamp recordings were performed at —65 mV, and current- 
clamp recordings were performed at rest. Patch-clamp pipettes contained the 
following internal solution (inmM): 125 K-gluconate, 10KCI, 10HEPES, 
4 Mg3-ATP, 0.3 Na-GTP, 10 phosphocreatine, 1 EGTA. Recordings were con- 
ducted using MultiClamp700B amplifier and pClamp10.3 software (Molecular 
Devices). pClamp10.3, OriginLab8 (OriginLab), and SigmaPlot (SPSS) were used 
to analyse data. Stationary photocurrent of the opsins was measured at the end of 
a 1s light pulse in voltage-clamp mode. Light-evoked spike probability in the 
mPFC neurons and in the postsynaptic BLA neurons was calculated as the frac- 
tion of successful action potentials evoked in the recorded neuron upon various 
light stimulation frequencies. Light-evoked EPSC amplitude in the postsynaptic 
BLA neurons was measured at the peak of the evoked response to light stimu- 
lation of the opsin-expressing mPFC fibres. Series resistance was monitored for 
stability, and recordings were discarded if series resistance changed significantly 
(by >20%) or reached 20 M®. Statistical analysis was performed with two-tailed 
t-test, with significance set at P<0.05. 
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Extended Data Figure 1 | Anatomical characterization of the AC-CA 
projection. a, Five days after RV-tdT injection into the hippocampus 
(coordinates specified), retrogradely labelled neurons were detected in the 
contralateral hippocampus (arrow), medial septum (bracket), and AC (Fig. 1a). 
Scale bars: <5: 300 um; X10: 100 [1m (confocal). b, Eight weeks after injection 
of CAV-Cre into the hippocampus and DIO-eYFP in the AC (coordinates 
specified), afferent cell bodies were detected in the AC (arrow). Confocal image; 


L & R hippocampus 5x 


RV - tdTomato 


L. Hippocampus 10x R. Hippocampus 10x 


X10 magnification; scale bar: 100 jum. c, Retrograde tracing with RV-tdT 
from the AC to map reciprocal connections from the hippocampus. The 
injection site is indicated in brackets, sparse labelling of afferent cell bodies 
in left and right hippocampus, primarily in the subiculum (arrows), and also in 
the medial dorsal thalamic nucleus as expected (asterisk). Confocal <5, scale 
bar, 200 jim; X10 images, scale bar 100 um. 
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Extended Data Figure 2 | Optogenetic manipulation of the AC-CA 
projection. a, Experimental design: RV-ChR2-eYFP (or eYFP alone) was 
injected into the dorsal hippocampus and light was delivered above the cell 
bodies in the AC. Five days after injection, ChR2 and eYFP mice were fear 
conditioned in context A while no-shock controls were only exposed to context 
A (day 1). All mice were tested with light on and off sessions in context B 
(day 2), and then tested for contextual memory retrieval in context A (day 3). 
Optogenetic stimulation was with 473 nm light in a train of 20 Hz, 15 ms pulses, 
30s duration, with 8-10 mW laser power at fibre tip. b, Freezing (no head 
motion observed) during day 2 is plotted in 5 s time bins over 150 s in context B 
(left). ChR2/shock (FC): black; ChR2/no shock (NS): red; eYFP/shock (eYFP): 
blue. Individuals in the FC group (each animal a different colour) are shown 
(middle). Summary (right): percentage time freezing (mean = s.d.) 20 s before 
light on (darker shade) versus 20 s after light on (lighter shade); FC: 60.9 + 7.4% 
light on versus 6.5 + 4.4% light off, n = 8; NS: 2.7 + 0.65% light on versus 
3.4 + 0.95% light off, n = 6; eYFP: 2.9 + 0.75% light on versus 3.6 + 1% light 
off, n = 6; P< 0.001, two-way ANOVA with repeated measures. c, Preservation 
of contextual fear memory (percentage time freezing) on day 3 in the 
original context (mean + s.d., P< 0.001, unpaired t-test). d, Preservation of 
contextual memory in medial septum injected mice (Fig. 1f); percentage 

time freezing on day 3 in the original context (mean ~ s.d., P< 0.001, 
comparisons shown, unpaired t-test). e, Preservation of contextual memory in 
hippocampus injected mice (Fig. 1h); percentage time freezing on day 3 in 
the original context (mean + s.d., n = 8 mice, P< 0.001, paired t-test). f, The 
successful loss-of-function experiments targeting hippocampus-dependent 
memory formation mediated by cells giving rise to the AC-CA projection 
(reported in Fig. 2) were designed to allow the most robust inhibition of this 
circuit element. An alternative design (attempting to target the projection 
field despite the broad and long septotemporal extent of the hippocampal 
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formation) was also explored as shown here but was not effective, as expected; 
we injected AAV5-eNpHR3.0-eYFP (or AAV5-eYFP in a parallel cohort) 
bilaterally into the AC, and targeted light stimulation bilaterally to axon 
terminals in the hippocampus. Eight weeks after injection, all mice were fear 
conditioned to context (day 1), and tested for context retrieval during light 
on/off sessions (day 2), and again for context retrieval in light-off only (day 3). 
Optogenetic inhibition was with constant illumination of 589 nm light, 30s 
duration, with 8-10 mW laser power at fibre tip. g, We observed a trend 
towards reduction in freezing due to optical inhibition of the AC-CA 
projection during memory retrieval. Percentage time freezing in context A 
during day 2 before light (darker bar on left) versus after light (lighter bar at 
right). eNpHR3.0: 73.5 + 8.5% light off versus 55.5 + 11.4% light on, n = 10; 
eYFP: 74 + 7.4% light off versus 74 + 11.3% light on, n = 10; percentage time 
freezing in context A with light off (dark bars) during day 3 is shown after 
dotted line. eNpHR3.0: 67.5 = 7.2%, n = 10; eYFP: 66.5 + 9.1%, n = 10 

(P = 0.067, two-way ANOVA). As expected, point illumination may be less 
effective for inhibiting broad axon terminal field volumes. h, Extension of 
findings from effective loss-of-function experiments (Fig. 2) targeting 
hippocampus-dependent memory formation mediated by cells giving rise 

to the AC-CA projection: significant effect on speed of onset of memory 
expression. Experimental design: CAV-Cre was injected into the dorsal 
hippocampus, DIO-eNpHR3.0 (or DIO-eYFP) was injected into the AC, 

and light was delivered above cell bodies in the AC. All mice were fear 
conditioned in context A (day 1), tested for latency to contextual retrieval with 
light-on only (day 2), and then for latency to context retrieval in light-off 
only (day 3). i, Day 2: 66.1 + 18.1 s for eNpHR3.0 (m = 12) versus 43.8 + 11.15 
for eYFP (n = 8) during light on; day 3: 53.8 + 13.7 s for eNpHR3.0 versus 
48.8 + 7.7 s for eYFP during light off; P < 0.05 two-way ANOVA with repeated 
measures. *P < 0.05, **P< 0.01, ***P< 0.001. 
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Extended Data Figure 3 | Physiological properties of GCaMP6m-expressing 
CA3 neurons. a, To ensure that expression of GCaMP6m did not alter Gat 
related physiological processes, we tracked a form of endocannabinoid- 
mediated short-term plasticity known as depolarization-induced suppression 
of inhibition (DSI). Schematic diagram of DSI is shown; DSI is dependent on 
the increase of postsynaptic intracellular Ca’* to trigger the synthesis and 
release of endocannabinoids, which then signal in a retrograde fashion to 
suppress GABA release from presynaptic inhibitory neurons expressing 
cannabinoid receptors (adapted with permission from ref. 65). Intrinsic 
membrane properties of the GCaMP6m-expressing CA3 cells were similar to 
previously reported values for CA3 (ref. 66); mean resting potential: 

—72.1 = 1.6mV; mean input resistance: 161.8 + 26.4 MQ; n = 7. b, Sample 
trace illustrating DSI of sIPSCs in a GCaMP6m-expressing CA3 cell following 
application of a depolarizing current step (from —65 mV to 0 mV for 500 ms). 
c, Sample trace illustrating lack of DSI of sIPSCs with inclusion of the 
intracellular calcium chelator BAPTA (1,2-bis(o-aminophenoxy)ethane- 
N,N,N’,N’-tetraacetic acid) in the patch pipette. d, Summary graph of 
normalized charge transfer in GCaMP6m-expressing cells with standard 
intracellular solution (left, normalized charge transfer of sIPSCs following DSI 


Day 


compared with pre-pulse baseline over the same fixed interval: charge reduced 
to 46.9 + 6.7% of baseline charge; n = 7; comparable to charge transfer 
reported for non-GCaMP-expressing cells®’) and with addition of intracellular 
BAPTA (right, m = 6; error bars represent standard error of the mean (s.e.m.); 
P<0.05, paired t-test). e, Spontaneous event rate (detection described in 
Methods) of GCaMP6m-expressing neurons as a function of baseline 
GCaMP6m fluorescence intensity (arbitrary units spanning the range over 
which event-rate population data could be reliably quantified) in each cell 
(pooling all neurons with =1 significant transient, from all mice, over all fields 
of view). Event rates were not observed to change significantly as a function of 
GCaMP6m expression level (Spearman’s rank correlation coefficient: 0.48, 
P=0.3). f, Behavioural scores from mice before GCaMP6m virus injection 
and implantation of cannulae above hippocampus; lick rates for the first 2 min 
in the fear (black) versus neutral (grey) contexts are provided. The level of 
learning assessed by lick suppression on day 3 retrieval (mean 0.5 + 0.3 for day 
3 fear versus 2.7 + 0.3 for day 3 neutral; n = 10, P< 0.001, paired t-test) 
pre-injection/implantation was comparable to levels corresponding to post- 
injection/implantation (compare with Fig. 3b). *P < 0.05, **P < 0.01, 

***P < 0.001. 
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Extended Data Figure 4 | Real-time imaging of neural ensembles in three- 
dimensional hippocampal volumes: extraction of neural sources and 
identification of significant transients. a, Head-fixed virtual reality setup. 
Mice run on an axially fixed track ball*' while movements and licking behaviour 
are measured through an optical mouse and a lickometer, respectively, both 
interfaced with the virtual-reality gaming software. For contextual fear 
conditioning, water-restricted mice were exposed to two contexts with 
distinct visual, olfactory, tactile and auditory cues (day 1), and provided with 
aversive air puffs in one context (fear context), but not the other (neutral 
context) (day 2). Fear memory retrieval in the two contexts was quantified (days 
3, 30) by lick suppression. b, Sample mean intensity z projections from raw 
videos (scale bar: 50 jm), with extracted neural sources (segmented cells) 
from CA3 for each of the optical sections, along with the first 50 time-series 
traces. Scale: 300% AF/F, 30 s. ¢, Identification of significant transients in AF/F 


traces. Histogram showing the distribution of events occurring at amplitude 20 
above noise (noise calculated on a per-cell basis), over a range of event duration 
in seconds. The number of negative-going transients at each amplitude and 
duration are plotted in red to the left of the ordinate, and positive-going 
transients at each amplitude and duration are plotted in blue to the right. 

d, e, The above analysis is repeated for events that occur at an amplitude of 
3a (d) and 4a (e). f, False positive rates for 2-, 3-, and 4-¢ events (pooled 
across all neurons in all mice over all FOVs). False positive rate curves were 
calculated for each a level by dividing the number of negative events at that level 
by the number of positive events at that level (Methods). Event onset was 
defined as the time corresponding to AF/F exceeding 2c, and offset as the time 
corresponding to AF/F falling below 0.50. A decaying exponential was fit by 
least-squares to the false positive rate values, allowing for the determination ofa 
minimum transient duration at each a level for different confidence levels. 
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Extended Data Figure 5 | Cell populations and graph properties of fear 
and neutral networks in hippocampus during retrieval. a, No context- 
dependent change in total event amplitude or rate was detected. Top, mean 
GCaMP6m-detected event amplitude (average AF/F of all significant events; 
definition of significant event for each neuron as described in Methods) is 
plotted across days for mice in the fear and neutral contexts (n = 5 mice, not 
significant in paired t-tests). Bottom, mean GCaMP6m-detected event rate 
plotted across days for mice in the fear and neutral contexts (n = 5 mice, not 
significant in paired t-tests). b, Context-dependent changes in individual- 
neuron and correlated behaviour were observed. Top, number of active 
neurons (at least one significant GCaMP6m transient detected within first 2 
min in context) plotted for fear and neutral contexts (m = 5 mice, 378 + 64 for 
day 3 fear context versus 257 + 39 for day 3 neutral context; P< 0.05, paired 
t-test, mean + s.d.). Bottom, mean number of correlated pairs per neuron 
(where pairwise Pearson’s correlation coefficient >0.3) plotted for fear and 
neutral contexts (n = 5 mice, 18.5 + 1.8 for day 2 fear context versus 13.4 + 1.4 
for day 2 neutral context; 11.3 + 0.8 for day 3 fear context versus 13.6 + 0.5 
for day 3 neutral context; P < 0.05, paired t-test). c, Fitting histograms from 
Fig. 3d to an exponential distribution of the form ae ** demonstrates a power- 
law (b>1) distribution in day 3 fear context (each red dot represents one 
mouse) compared to day 3 neutral context (b < 1; green dots), which was 
consistent across all mice (n = 5 mice; P< 0.01, paired f-test). Many graph 


properties were calculated for fear versus neutral context, but the power-law 
exponent of the degree distribution distinguished fear (red) from neutral 
(green) most powerfully (discriminants shown: coefficient of the power law 
exponent = 0.78, coefficient of cluster coefficient = 0.61, coefficient of mean 
path length = 0.11, with 90% confidence intervals being [0.74, 1.0], [0.1, 0.65] 
and [0.01, 0.23], respectively). These confidence intervals were obtained 
using 1,000 bootstrapped samples; shown is the best three-dimensional 
hyperplane separation using a linear support vector machine classifier. 

d, Histograms of the number of correlated partner neurons existing for each 
neuron in fear versus neutral context on day 3 (retrieval testing) across mice. 
The dotted red line indicates correlation threshold (set automatically as 
mean + 1 standard deviation in the number of correlated pairs in the network), 
to the right of which lie (by definition) the highly correlated or HC neurons. 
Similar measurements of interest in fear versus neutral context across mice 
were calculated and are provided here for other graph invariant properties: 

e, betweenness centrality; f, g, clique properties; h, strength; i, cluster coefficient; 
and j, mean path length (all defined in Methods). For the above calculations, 
correlation between two neurons was defined to exist when the pairwise 
Pearson correlation coefficient exceeded 0.3 (Methods). Data are presented as 
individual data points corresponding to each mouse, with mean = s.d. 

*P < 0.05, **P < 0.01, ***P < 0.001, using paired t-tests. 
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Extended Data Figure 6 | Functional relationships of fear-context-defined 
HC neurons as appearing in fear versus neutral context. a—c, Data from all 
additional mice (beyond the exemplar of Fig. 3e, f) demonstrating that HC 
neurons (red circles) in the fear context with a high degree of correlated 
partners (grey edges) when located in the neutral context have a much lower 
degree of correlated partners (n = 4 mice including the example in Fig. 3e, f); 


y (um) 


X (um) oe 600 
mean = 60 correlated pairs (standard deviation 19.4) in fear context versus 
mean = 18 correlated pairs (standard deviation 14.2) in neutral context; 
P<0.01 by paired t-test). Only four mice are analysed here because the exact 
same FOV (with cell identities) was not captured in fear versus neutral context 
for one mouse. 
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Extended Data Figure 7 | Spatial and temporal organization of HC neurons. 
a, Plot of mean pairwise correlation versus mean pairwise distance averaged 
over all FOVs (all days and all contexts) from all five mice. It was possible to 
detect a significant but weak relationship between mean correlation and 
distance (Spearman’s correlation = —0.66, P = 0.01), which could be a 
reflection of fine-scale spatial clustering as might be expected of recurrent 
circuits in CA3, but would also probably include residual crosstalk between 
regions of interest (ROIs) due to brain motion and common neuropil signal, 
which is expected and not significantly different from what has been previously 
observed in the hippocampus’. b, Plot of the number of correlated pairs 
versus pairwise distance for all neurons (black line), and HC neurons only 
(grey line). More correlated pairs were found at greater distances for HC 
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neurons (Spearman’s = 0.84, P = 0.002 for HC neurons; Spearman’s 
correlation = 0.23, P = 0.43 for all neurons). c, Cumulative distributions 
showing fraction of HC neurons (y-axis) with onset times at various latencies 
across the time course of synchronous events (x-axis) averaged across all mice, 
compared to response latencies of non-HC neurons. HC neuron activity 
appeared significantly earlier than for non-HC neurons during synchronous 
events (P< 0.001, Kolmogorov-Smirnov two-tail test, « = 0.664; note the 
horizontal resolution of the plot is inversely proportional to length of the 
synchrony window, and dependent on frame duration; for instance, a 10-s-long 
synchrony window with frame duration of 333 ms corresponds to a 3.33% 
resolution per frame). 
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Extended Data Figure 8 | Additional analyses: estimation of event onsets 
using fast non-negative deconvolution, and correlated pair analysis. 

a, Example pairings of the original GCaMP6m trace (top traces), with the 
deconvolved trace (bottom traces), shows reliable estimation of event onset 
from deconvolved data (deconvolution algorithm and parameters detailed 
further in Methods). Scale bar: 150% AF/F, 10 s. b, Original GCaMP6m traces 
from a representative synchronous event in one animal (left), paired with 

the deconvolved traces for that same synchronous event (right). Scale bar: 
300% AF/F, 10 s. c, The highest-degree node (neuron with the greatest number 
of correlated pairs) in the day 3 fear context had significantly more correlated 
pairs than the highest degree node in the day 3 neutral context, significant 


across n = 5 mice (58.8 versus 33.2 pairs, P< 0.01, paired t-test). d, Temporal 
relationship of HC neuron activity onset (set to time 0) compared with onset 
activity of correlated pairs (binned into 333 ms preceding or succeeding HC 
activity); n = 48 HC neurons. HCs were more likely to lead than lag their 
correlated pairs (58.5 + 20% leading versus 24.4 + 10% lagging; P< 0.01, 
unpaired t-test). e, Significant synchronous activity (defined in Methods) 
quantified across five mice: number of synchronous events in the fear context 
was significantly greater than in the neutral context (5.8 + 2.9 events in fear 
context versus 1.2 + 1.1 in neutral context; P< 0.01, paired t-test). 

*P < 0.05, **P<0.01, ***P < 0.001. 
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Extended Data Figure 9 | PCA of population trajectories in fear versus 
neutral contexts. a, PCA of AF/F traces of all active cells for mouse 2, 
performed separately for fear and neutral contexts. Population trajectories in 
the fear context take large, nearly orthogonal, deviations from the centre, while 
neutral context trajectories remain close to the origin. b, Three-dimensional 
reconstruction of the neuronal population showing that neurons participating 
in each synchronous event (red cells (t = 39), green circles (t = 92), blue circles 


t=210 


(t = 200)) are largely non-overlapping and anatomically homogenously 
distributed throughout the volume. There are, however, a small fraction of 
neurons that participate in all three events (black circles). c, d, Data are shown 
for another representative mouse. Similar results were seen in all other mice. 
e, The AF/F traces for a randomly selected set of 30 neurons participating in 
each of three events are shown, with the greatest amount of overlap seen 
between t = 65 and t = 210. Scale bars: 400% AF/F, 20 s. 
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Extended Data Figure 10 | bReaChES: engineering a red-shifted opsin for 
robust projection targeting. a, Schematics of ReaChR* and bReaChES. 
ReaChR is a hybrid of segments from channelrhopsin-1 (blue, amino acids 
1-95), Volvox channelrhodopsin-1 (red, amino acids 96-246, 279-350) and 
Volvox channelrhodopsin-2 (green, amino acids 247-278). The VChR1 
segment contains the point mutation Leu171Ile. ReaChR was modified here 
for enhanced expression and membrane trafficking as well as accelerated 
channel kinetics, resulting in bReaChES, as follows. The first 51 amino- 
terminal residues were replaced by the first 11 amino-terminal residues from 
channelrhodopsin-2 (yellow, amino acids 1-11) and the last 5 carboxy- 
terminal residues were removed. Mutation of Glu 123 to Ser increases speed 
of channel closure. b, Spectra of C1V1+7, bReaChES and ChR2 measured 
between 400 and 650 nm at 0.65 mW mm ” in cultured neurons from rat 
hippocampus (n = 6 each). ¢, Stationary photocurrents at 575 nm (C1V17r 
630 + 109 pA (s.e.m. throughout figure), ReaChR 963 + 113 pA, bReaChES 
1,365 + 128 pA) and 632 nm (C1V 177 315 + 111 pA, ReaChR 1,003 + 95 pA, 
bReaChES 841 + 102 pA). Current amplitudes were measured at —80 mV 


ARTICLE 


and 5 mW mm ~ light intensity, respectively. d, Speed of channel closure: 

t value of mono-exponential off-kinetics (C1V1 py 79 + 3.7 ms, n = 26; 
ReaChR 682 + 86 ms, n = 6; bReaChES 49 + 4.4 ms, n = 25; P< 0.0005). 

e, f, Representative current-clamp traces of ReaChR- or bReaChES-expressing 
cultured neurons excited with 633 nm light (5 ms, 5 mW mm”). ReaChR 
kinetics were slow enough that reliable action potential generation was only 
possible at very low frequencies (e), while the accelerated channel closure of 
bReaChES allowed reliable spike generation up to 20 Hz (f). g, h, Representative 
voltage-clamp (g) and current-clamp (h) traces of postsynaptic cells 
responding to light stimulation (orange) of bReaChES-expressing presynaptic 
terminals. Pulse length: 5 ms. i, j, Stationary photocurrents (i) and light-evoked 
spike probability (j) in opsin-expressing medial PFC (mPFC) cells in acute 
slice (C1V1 pr: n = 11, bReaChES: n = 10). k, 1, Light-evoked EPSC amplitude 
(k) and spike probability (1) in postsynaptic cells (C1V1 +r: n = 10, bReaChES: 
n= 18). Light wavelength 575 nm (25 nm bandwidth) and power density 
5mWmm ~. *P<0.05, **P< 0.01, ***P< 0.001. 
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Cleavage of GSDMD by inflammatory 
caspases determines pyroptotic cell death 


Jianjin Shib?*, Yue Zhao**, Kun Wang’, Xuyan Shi’, Yue Wang’, Huanwei Huang’, Yinghua Zhuang”, Tao Cai?, 
Fengchao Wang? & Feng Shao?** 


Inflammatory caspases (caspase-1, -4, -5 and -11) are critical for innate defences. Caspase-1 is activated by ligands of 
various canonical inflammasomes, and caspase-4, -5 and -11 directly recognize bacterial lipopolysaccharide, both of 
which trigger pyroptosis. Despite the crucial role in immunity and endotoxic shock, the mechanism for pyroptosis 
induction by inflammatory caspases is unknown. Here we identify gasdermin D (Gsdmd) by genome-wide clustered 
regularly interspaced palindromic repeat (CRISPR)-Cas9 nuclease screens of caspase-11- and caspase-1-mediated 
pyroptosis in mouse bone marrow macrophages. GSDMD-deficient cells resisted the induction of pyroptosis by 
cytosolic lipopolysaccharide and known canonical inflammasome ligands. Interleukin-1B release was also diminished 
in Gsdmd cells, despite intact processing by caspase-1. Caspase-1 and caspase-4/5/11 specifically cleaved the linker 
between the amino-terminal gasdermin-N and carboxy-terminal gasdermin-C domains in GSDMD, which was 
required and sufficient for pyroptosis. The cleavage released the intramolecular inhibition on the gasdermin-N 
domain that showed intrinsic pyroptosis-inducing activity. Other gasdermin family members were not cleaved by 
inflammatory caspases but shared the autoinhibition; gain-of-function mutations in Gsdma3 that cause alopecia and 
skin defects disrupted the autoinhibition, allowing its gasdermin-N domain to trigger pyroptosis. These findings offer 
insight into inflammasome-mediated immunity/diseases and also change our understanding of pyroptosis and 


programmed necrosis. 


Inflammatory caspases (caspase-1, murine caspase-11 and human 
caspase-4/5) are crucial for innate immune defences. Aberrant or 
excessive activation of caspase-1 causes or is associated with many 
autoinflammatory, autoimmune and even metabolic diseases’. 
Caspase-1 is activated by the canonical inflammasomes, in which a 
central scaffold, such as NLRP3, NLRP1, NAIP-NLRC4, AIM2 and 
Pyrin, detects its cognate ligand’. The NAIPs directly recognize fla- 
gellin, as well as the rod and needle components of bacterial type III 
secretion system (T3SS)*°. The newly identified Pyrin inflammasome 
indirectly senses bacterial modifications and inactivation of host Rho 
GTPases®. Caspase-11 detects cytosolic bacterial lipopolysaccharide 
(LPS), playing a critical role in endotoxic shock”*. Caspase-11 and 
its human counterparts caspase-4/5 are activated by direct LPS 
binding”””. 

Caspase-1 activation mainly occurs in macrophage/dendritic cells. 
Active caspase-1 processes and maturates interleukin (IL)-1B/18 and 
also triggers a form of programmed necrosis known as pyroptosis. 
Pyroptosis is the dominant response upon caspase-4/5/11 activation, 
occurring in both macrophage and non-macrophage cells". 
Pyroptosis features pore formation in the plasma membrane, cell 
swelling and rupture of the membrane, causing massive leakage of 
cytosolic contents. Accumulating evidences suggest a critical role of 
pyroptosis in immunity and disease’*. Pyroptosis of infected cells 
releases intracellular bacteria for neutrophil-mediated killing’. 
Inflammasome-mediated, but IL-1B/18-independent, clearance of 
infection has been noted with several intracellular bacteria’?"». 
Pyroptosis induced by the NLRP1B inflammasome or a gain-of- 
function mutation in Nirpla is the primary cause of anthrax-lethal- 
toxin-induced lung injury’® and haematopoietic progenitor cell 


depletion in mice’’, respectively. Caspase-1-mediated pyroptosis in 
HIV infection accounts for CD4* T-cell depletion’’, a critical event 
in HIV pathogenesis. Pyroptosis rather than cytokine secretion is prob- 
ably the key determinant of mouse lethality caused by systemic activa- 
tion of the NAIP-NLRC4 inflammasome or caspase-11 activation by 
excessive LPS”*", Despite these important functions, the mechanism 
underlying how inflammatory caspases trigger pyroptosis is unknown. 


GSDMD is required for LPS-induced pyroptosis 


We previously established an LPS electroporation protocol that can 
induce pyroptosis in more than 90% of LPS-stimulated cells’. This 
robust assay allowed for unbiased genome-wide genetic screen 
using the CRISPR-Cas9 technology to identify new components 
in LPS/caspase-4/11-induced pyroptosis. The screen was per- 
formed in Tir4-‘~ immortalized bone-marrow-derived macrophages 
(iBMDMs) that responded normally to LPS electroporation 
(Extended Data Fig. 1a). As expected, three out of the four Casp11- 
targeting guide RNAs (gRNAs) were recovered within the top 100 
hits. Notably, there was only one other gene gasdermin D (Gsdmd) 
that also had multiple gRNA hits; four out of the five Gsdmd-targeting 
gRNAs appeared in the top 30 hits, with two among the top 10 
(Fig. 1a). GSDMD is conserved in human and mouse with ~72% 
sequence similarity (Extended Data Fig. 2a). Small interference 
RNA (siRNA) knockdown in HeLa cells excluded other selected top 
hits but confirmed the requirement of GSDMD (Extended Data 
Fig. 1b-d, also see Methods). siRNA knockdown of Gsdmd in mouse 
iBMDMs also inhibited LPS-induced pyroptosis; the extent of inhibi- 
tion by three different siRNAs correlated with their knockdown effi- 
ciency (Extended Data Fig. le, f). 


1Peking University-Tsinghua University-National Institute of Biological Sciences Joint Graduate Program, School of Life Sciences, Tsinghua University, 100084, China. ?National Institute of Biological 
Sciences, Beijing 102206, China. ?National Laboratory of Biomacromolecules, Institute of Biophysics, Chinese Academy of Sciences, Beijing 100101, China. “National Institute of Biological Sciences, 


Beijing, Collaborative Innovation Center for Cancer Medicine, Beijing 102206, China. 
*These authors contributed equally to this work. 
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To further validate the role of GIDMD, GIDMD ‘~ HeLa cells and 
Gsdmd~'~ iBMDM cells were generated by CRISPR-Cas9-mediated 
targeting (Extended Data Fig. 3a). In both cells, the absence of 
GSDMD completely blocked LPS electroporation-triggered pyropto- 
sis, which was confirmed in multiple knockout clones (Extended Data 
Fig. 3b, c). Re-expression of human GSDMD in three GSDMD~'~ 
HeLa cell clones all restored LPS-induced pyroptosis (Fig. 1b). Mouse 
GSDMD showed the same rescue effect (Extended Data Fig. 3d). 
Similarly, Gsdmd~'~ iBMDM cells regained the sensitivity to LPS 
electroporation when complemented with either human or mouse 
GSDMD (Fig. 1c and Extended Data Fig. 3e). 


GSDMD is required for caspase-1-mediated pyroptosis 


We also performed a parallel CRISPR-Cas9 screen on caspase-1- 
mediated pyroptosis in iBMDM cells using the anthrax lethal factor 
N-terminal-domain-fused T3SS rod protein of Burkholderia thailan- 
densis (LFn-BsaK), the most potent agonist of the NAIP-NLRC4 
inflammasome*”’. Notably, the top hits recovered from the screen 
were dominated by gRNAs targeting all known components in the 
pathway, including Naip2, NIrc4 and Casp1, each of which was hit by 
multiple gRNAs (Extended Data Fig. 4a). The screen also hit Gsdmd 
which, along with seven other genes, had two independent gRNAs 
(Extended Data Fig. 4a). The requirement of Gsdmd, but not the 
seven other genes, for LFn—BsaK-induced pyroptosis was confirmed 
by siRNA knockdown assays (Extended Data Fig. le, f and 4b). 


"WT__AsipD__WT__AsipD _ (f-i) independent experiments. 
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Furthermore, Gsdmd~'~ iBMDM clones completely resisted LFn- 
BsaK-induced pyroptosis (Extended Data Fig. 3f), which was fully 
restored by exogenous expression of GIDMD (Fig. Ic). Gsdmd /~ 
iBMDMs were also unresponsive to stimulation by LFn-flagellin 
(FliC), an agonist of the NAIP5-NLRC4 inflammasome (Fig. 1d). 
Caspase-7 was previously shown to be cleaved by caspase-1 upon 
inflammasome activation by flagellin in Legionella pneumophila- 
infected mouse macrophages”'. We confirmed the cleavage of cas- 
pase-7/3 in LFn-BsaK-stimulated Gsdmd~'~ iBMDMs (Extended 
Data Fig. 4c). However, evident cleavage could be observed 6 h after 
stimulation, a time point when Gsdmd-proficient cells had already 
died of pyroptosis (therefore little caspase-3/7 cleavage occurred). 
This suggests that caspase-3/7 cleavage plays no role in caspase-1- 
mediated pyroptosis, but may induce apoptosis or have other func- 
tions in defence against L. pneumophila*'’. Thus, GIDMD plays a 
critical role in caspase-1-mediated pyroptosis downstream of the 
NAIP-NLRC4 inflammasome. 


GSDMD is only required for pyroptotic cell death 

We further examined the role of GSDMD in other canonical- 
inflammasome-triggered pyroptosis. Gsdmd‘~ completely blocked 
iBMDM cell pyroptosis upon activation of the NLRP3 and AIM2 
inflammasomes by LPS plus nigericin and poly(dA:dT), respectively 
(Fig. 1d, e). In contrast, Gsdmd~'~ did not affect extrinsic and intrinsic 
apoptosis triggered by TNFa plus cycloheximide (TNFa + CHX) and 
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ultraviolet irradiation, respectively (Fig. 1d). siRNA knockdown of 
GSDMD in HT-29 cells, though efficiently blocking LPS-induced pyr- 
optosis, had no effect on TSZ (TNFa plus SMAC mimetic and the 
caspase inhibitor zVAD)-induced necroptosis, another form of pro- 
grammed necrosis mediated by the RIPK1-RIPK3-MLKL axis” 
(Extended Data Fig. 5a-c). 

We generated GSDMD-deficient mice by CRISPR-Cas9-mediated 
targeting. Primary BMDMs from two independent homozygous 
mutant mice (F1-1 and F1-2) (Extended Data Fig. 6a) were analysed 
for inflammasome activation, and similar results were obtained. In the 
canonical inflammasome pathway, pyroptosis mediated by the NAIP2/ 
5-NLRC4, NLRP3 and AIM2 inflammasomes were all blocked by 
GSDMD deficiency (Fig. 1f). Gsdmd ‘~ also abolished pyroptosis upon 
Clostridium difficile toxin B activation of the Pyrin inflammasome’. 
Similarly, Gsdmd‘~ BMDMs resisted LPS-induced pyroptosis 
(Fig. 1f). When infected with Salmonella typhimurium, wild-type 
BMDMs showed extensive pyroptosis due to T3SS-dependent activa- 
tion of NAIP-NLRC4 inflammasome, but little pyroptosis occurred in 
BMDMs from Gsdmd~‘~ mice (Fig. 1g). Defective pyroptosis in 
Gsdmd‘~ BMDMs was not due to the absence of caspase-1/11 
expression (Extended Data Fig. 6b). Consistent with that in iBMDMs, 
primary Gsdmd '~ BMDMs remained sensitive to necroptosis induc- 
tion by TSZ or LPS plus zVAD (Extended Data Fig. 5d, e). These data 
highlight a specific role of GSDMD in caspase-4/11 and caspase-1- 
mediated pyroptosis downstream of multiple inflammasomes. 


GSDMD controls IL-1f release but not maturation 


IL-1 maturation and secretion is the other major response of 
canonical inflammasome activation. We observed that primary 
Gsdmd~'~ BMDMs, in contrast to wild-type BMDMs, secreted little 
IL-1 into the supernatant upon activation of the NAIP2/5-NLRC4, 
NLRP3, Pyrin and AIM2 inflammasomes (Fig. 1h). S. typhimurium- 
infection-induced IL-1 release was also inhibited in Gsdmd '~ 
BMDMs (Fig. li). Notably, caspase-1 autoprocessing occurred 
normally in Gsdmd~‘~ BMDMs upon activation of the above inflam- 
masomes while extracellular secretion of both pro-caspase-1 and the 
mature caspase-1 was severely inhibited (Extended Data Fig. 6c). 
Intact caspase-1 autoprocessing and IL-1f maturation were also 
observed in the cytosol of Gsdmd~‘~ iBMDMs upon canonical 
inflammasomes activation (Fig. 2a, b and Extended Data Fig. 7a). 
Thus, GSDMD-mediated pyroptosis plays an important role in 
mature IL-1 release without affecting its maturation. 


GSDMD is specifically cleaved by caspase-1/4/5/11 


The above analyses predict that GSDMD functions downstream of the 
inflammatory caspases. We observed that GSDMD was cleaved in 
LPS and LFn-BsaK-stimulated HeLa and iBMDM cells, respectively 
(Fig. 2c). A cleavage product (~38kDa) corresponding to the 
N-terminal half of 2 Flag-HA-GSDMD was constantly identified. 
LPS and LFn-BsaK-induced GSDMD cleavage was diminished by a 
pan-caspase inhibitor zVAD that also blocked cell death (Extended 
Data Fig. 7b). Co-expression of GSDMD with caspase-1, 4, 5 or 11 but 
not apoptotic caspases (caspase-2, 8 and 9) in 293T cells induced the 
same cleavage of GSDMD (Extended Data Fig. 7c). No GSDMD 
cleavage occurred in TNFa +CHX and TSZ-stimulated cells 
(Extended Data Fig. 5f), consistent with that GSDMD was not 
required for apoptosis and necroptosis (Fig. 1d and Extended 
Data Fig. 5c). 

We then tested whether GSDMD is a direct substrate of inflam- 
matory caspases. The active tetramer forms of caspase-1, 4 and 11, but 
not caspase-2, 8 and 9, were found capable of cleaving purified recom- 
binant GSDMD (no tag, ~53 kDa) (Fig. 2d). The larger-size cleavage 
product migrated at ~31 kDa, which plus the 7-kDa 2 Flag-HA tag 
equals the 38-kDa N-terminal cleavage product observed in cells 
(Fig. 2c). The other cleavage product (~22 kDa) matched the size of 
the remaining C-terminal fragment. Consistent with that caspase-4/11 
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Figure 2 | GSDMD is a substrate specifically for inflammatory caspases. 

a, b, Effects of Gsdmd knockout on intracellular processing of caspase-1 and IL- 
1 (stably expressed).p10, mature caspase-1; Pro-CASP1, caspase-1 precursor; 
Pro-IL-1B, IL-1B precursor; p17, mature IL-1. c, A specific cleavage in 
GSDMD induced by LPS electroporation and LFn-BsaK. GIDMD-FL, full- 
length GSDMD. d, In vitro cleavage of GIDMD by recombinant caspases. 
Purified GSDMD was incubated with active p20/p10 tetramers of indicated 
caspases (rCASPs). e, In vitro cleavage of GIDMD by LPS-activated caspase- 
11/4. Purified GSDMD was incubated with insect-cell-purified caspase-11/4 
pre-incubated with LPS or indicated ligands. The N-terminal sequence of 
GSDMD.-C determined by Edman sequencing is shown. f, Cartoon diagram 
of GSDMD structure and the cleavage by inflammatory caspases. LFn—BsaK 
and LPS + Nig. were used to activate the NAIP2-NLRC4 and NLRP3 
inflammasome, respectively (a-c). 2Flag-HA-GSDMD was stably expressed 
in HeLa and iBMDM cells (c). Reaction samples were analysed by SDS- 
PAGE and Coomassie blue staining (d, e). Cell lysates were subjected to anti- 
caspase-1 (a), anti-IL-1f (b), anti-Flag (c), anti-tubulin (a, c) and anti-actin 
(b) immunoblotting. All data shown are representative of three independent 
experiments. 


are activated by direct LPS binding’, insect-cell-purified caspase-11 
could process recombinant GSDMD into the 31kDa and 22kDa 
fragments only upon prior incubation with LPS or Lipid A but 
not muramyl dipeptide (MDP) (Fig. 2e). Identical results were 
observed with insect-cell-purified caspase-4. Edman sequencing of 
the C-terminal fragment revealed an N-terminal sequence of 
276GVPAEGpg; (Fig. 2e). Preceding 776GVPAEGp¢; is 272FLTD27; 
in human GSDMD and 273LLSD27. in mouse GSDMD (Fig. 2f), 
which fit well with the substrate recognition motif of inflammatory 
caspases”. 97xFLTD 75 is the only possible inflammatory-caspase 
cleavage site conserved in human and mouse GSDMD (Extended 
Data Fig. 2a). Thus, inflammatory caspases specifically cleave 
GSDMD after the 27.FLTD275 (or 273LLSD27¢) sequence (Fig. 2f). 


Cleavage of GSDMD is required for pyroptosis 

We generated a mutant GSDMD, in which Asp275 (Asp276 in mouse 
GSDMD) was replaced with an alanine. As expected, the D/A mutants 
of GSDMD resisted cleavage by caspase-1 and caspase-4/5/11 over- 
expressed in 293T cells (Extended Data Fig. 8a, b). Physiologically 
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activated caspase-1 by LFn-BsaK also failed to cleave GIDMD D275A 
complemented into the Gsdmd~'~ iBMDMs (Fig. 3a); importantly, 
the mutant was completely inactive in restoring LFn-BsaK-induced 
pyroptosis (Fig. 3b). GIDMD D275A also resisted cleavage by cas- 
pase-11 and caspase-4 in LPS-stimulated iB MDM and HeLa cells, 
respectively (Fig. 3a, c), and was unable to mediate LPS-induced 
pyroptosis (Fig. 3b, d). 

In bacterial infection assays, complementation of the Gsdmd~/~ 
iBMDMs with wild-type GSDMD restored S. typhimurium-induced 
pyroptosis but the D275A mutant showed no activity (Fig. 3e). 
While wild-type GSDMD was cleaved in the T3SS-dependent man- 
ner, the D275A mutant protein remained intact (Extended Data 
Fig. 8c). The same results were observed in B. thailandensis and enter- 
opathogenic Escherichia coli infections (Fig. 3f and Extended Data 
Fig. 8d). In HeLa cell infection, S. typhimurium AsifA triggered 
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Figure 3 | Cleavage of GSDMD by inflammatory caspases is required for 
pyroptosis. a-d, Assays of the cleavage-resistant mutant of GIDMD in 
caspase-1 and caspase-4/11-mediated pyroptosis induced by LFn-BsaK and 
LPS electroporation, respectively. e-g, Assays of the cleavage-resistant 
mutant of GSDMD in bacterial-infection-induced pyroptosis. iBMDM cells 
were infected with wild-type S. typhimurium (e), B. thailandensis or 
enteropathogenic E. coli (EPEC) (f) or their T3SS-deficient mutants (AsipD, 
AbipB and AescN, respectively) as controls. HeLa cells were infected 

with S. typhimurium AsifA (g). The wild-type or the D275A mutants (D/A) 
of 2X Flag-HA-GSDMD (a, c) and Flag-GSDMD (pb, d-g) were stably 
expressed in the knockout cells. Cell lysates were analysed by anti-Flag 
and anti-tubulin immunoblotting (a—d). Lysates of non-stimulated 

cells were analysed in b and d. ATP-based cell viability (b, d, e) and 
LDH-release-based cell death (f, g) are expressed as mean values + s.d. from 
three technical replicates. All data shown are representative of three 
independent experiments. 
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caspase-4-dependent pyroptosis””*. This cell death was diminished 
in GSDMD~‘~ cells (Fig. 3g). Notably, expression of wild-type 
GSDMD but not the D275A mutant in GSDMD~‘~ cells restored 
S. typhimurium AsifA-triggered pyroptosis (Fig. 3g). 


Cleavage of GSDMD is sufficient to drive pyroptosis 


To investigate whether cleavage at Asp275 in GSDMD is sufficient to 
initiate pyroptosis, a PreScission protease (PPase) recognition sequence 
(EVLFNGP) was inserted between Leu273 and Thr274 in the 
272FLTD275 motif in GSDMD. Expression of this engineered 
GSDMD(EVLENGP) mutant or wild-type GSDMD in 293T cells by 
itself triggered no pyroptosis. However, upon cytosolic delivery of puri- 
fied PPase, cells expressing GSDMD(EVLENGP) but not wild-type 
GSDMD died extensively (Fig. 4a). Consistently, GIDMD(EVLFNGP) 
but not the wild-type protein was cleaved into the expected size by the 
PPase (Fig. 4b). The dying cells developed typical pyroptosis morpho- 
logy with cell swelling and membrane rupture (Fig. 4a). We also 
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Figure 4 | Interdomain cleavage of GSDMD is sufficient to trigger 
pyroptosis owing to the intrinsic pyroptosis-inducing activity in its 
N-terminal domain. a, b, Pyroptosis induced by interdomain cleavage of an 
engineered GSDMD by PPase. 293T cells were transfected with Flag-tagged 
wild-type GSDMD or GSDMD-Mutl (insertion of the PPase recognition 
sequence EVLFQGP into the FLTD site). Purified PPase was electroporated 
into the cells. c, d, Switch of TNFa-induced apoptosis to pyroptosis by the 
caspase-3-sensitive GSDMD mutant. Flag-tagged wild-type GSDMD or 
GSDMD-Mut2 (replacement of the FLTD motif with the caspase-3-cleavage 
site DEVD) was stably expressed in HeLa GSDMD ‘~ cells. Cells were 
treated with TNFa + CHX for 8 h. e, f, The pyroptosis-inducing activity of 
GSDMD N-terminal domain. Indicated GSDMD fragments were transiently 
expressed in 293T cells. GIDMD-FL, full-length GSDMD; GSDMD-N and 
GSDMD-C, the N- and C-terminal product of GIDMD generated by 
inflammatory caspase cleavage, respectively. Phase-contrast images of cell 
morphology are shown in a, c and e. ATP-based cell viability is expressed as 
mean values + s.d. from three technical replicates (a, c, f) and are marked on 
the images in decimal form (a, c). Cell lysates were analysed by anti-Flag and 
anti-tubulin immunoblotting (b, d, f). All data shown are representative of 
three independent experiments. 
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engineered the FLTD site into DEVD, the cleavage site of caspase-3/7, 
and the resulting GSDMD(DEVD) mutant was stably expressed in 
GSDMD ‘~ Hela cells. When stimulated with TNF + CHX, both 
wild-type GSDMD and GSDMD(DEVD) mutant-expressing cells 
showed massive cell death (Fig. 4c). As expected, cells expressing wild- 
type GSDMD underwent apoptosis with no GSDMD cleavage. In con- 
trast, GIDMD(DEVD) mutant-expressing cells showed typical pyrop- 
totic morphology, in which the mutant GSDMD was proteolysed to the 
expected size by activated endogenous caspase-3/7 (Fig. 4c, d). The 
switch of TNFo-induced apoptosis to pyroptosis by the caspase-3/7- 
sensitive GIDMD(DEVD) can be observed in Supplementary Videos 
1 and 2. These results establish that proteolytic cleavage at Asp275 in 
GSDMD is sufficient to instruct mammalian cells to undergo pyroptosis. 


Pyroptosis-inducing activity of GIDMD-N domain 


The function of GIDMD is completely unknown. Gsdmd~‘~ mice 
have no obvious phenotypes with normal epithelial differentiation 
in the intestinal tract despite high expression in gastrointestinal 
tracts. To understand the functional mechanism of GSDMD 
cleavage in pyroptosis, the N-terminal and C-terminal cleavage 
fragments (GSDMD-N and GSDMD-C, respectively) were individu- 
ally expressed in 293T cells. In contrast to GIDMD-C and full-length 
GSDMD, GSDMD-N could cause extensive cell death with apparent 
pyroptosis morphology (Fig. 4e, f). GSDMD-N was expressed at a 
much lower level than GSDMD-C and full-length GSDMD, a phe- 
nomenon commonly seen in proteins with cytotoxicity. Progressive 
truncations identified residues 1-243 of GSDMD as the minimal 
fragment capable of triggering pyroptosis (Fig. 4f). 


Autoinhibition of GSDMD and the gasdermin family 


Above analyses suggest that GIDMD-C may interact with GSDMD- 
N and this autoinhibition is released upon interdomain cleavage by 
inflammatory caspases. Supporting this idea, GSDMD-C was effi- 
ciently co-immunoprecipitated by GSDMD-N in transfected 293T 
cells (Fig. 5a). Furthermore, overexpression of GSDMD-C could 
block LPS-induced pyroptosis in HeLa cells due to trans-inhibition 
of endogenous GSDMD-N generated from caspase-4 cleavage 
(Fig. 5b). 

GSDMD belongs to the gasdermin family that also contains 
GSDMA, GSDMB, GSDMC, DFNA5 and DFNB59 (Extended Data 
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Fig. 2b)”*’°. The gasdermin family shares ~ 45% overall sequence 
homology. Structures of the gasdermins can be divided into two 
domains, the gasdermin-N and -C domains, corresponding to the 
two cleavage fragments of GSDMD by inflammatory caspases 
(Fig. 2f). Mice lack GsdmB, but encode three GIDMA (GSDMA1- 
A3) and four GSDMC (GSDMC1-C4) proteins (Extended Data 
Fig. 2b). Other gasdermins do not share the FLTD motif in 
GSDMD (Extended Data Fig. 2a). Consistently, GIDMA, GSDMB 
and GSDMC resisted caspase-1/11 cleavage in 293T cells (Extended 
Data Fig. 9a). The gasdermin-N domain of GIDMA, GSDMB and 
GSDMC all could bind to its respective gasdermin-C domain (Fig. 5c), 
suggesting a similar autoinhibition that is released by mechanisms 
other than caspase cleavage. 


Pyroptosis-inducing activity of GSDMA3 in disease 


The function of the gasdermin family is poorly characterized**”®. 
GSDMAS3 is the most studied gasdermin; mice with spontaneous or 
chemically induced mutations in Gsdma3 exhibit hyperkeratosis and 
hair-loss phenotypes in the skin**””. The mechanism for this alopecia 
pathology is unknown, but severe chronic inflammation has been 
observed in the skin of Gsdma3-mutant mice”. Like GSDMD, 
GSDMA3 exhibited the intramolecular interaction between its gas- 
dermin-N and -C domains (Fig. 5c). Overexpression of the gasder- 
min-N domain of GSDMA3 (residues 1-284), but not full-length 
GSDMA3, caused extensive pyroptosis in 293T cells (Fig. 5d); 
GSDMA3 was not cleaved by inflammatory caspases but artificial 
interdomain cleavage of an engineered GSDMA3 by PPase could 
cause 293T cell pyroptosis (Extended Data Fig. 9b). GSDMA3 also 
remained intact in TNFo-induced apoptosis and necroptosis 
(Extended Data Fig. 9c). These data suggest that GSDMA3 is an 
autoinhibited pyroptosis-inducing factor functioning in an unknown 
biological process. The autoinhibition is partially supported by a 
recent intriguing study that proposes a function of GSDMA3 in 
autophagy”’. 

Gsdma3 '~ mice show no visible skin defects*. In fact, the hyper- 
keratosis and hair-loss phenotype in Gsdma3-mutant mice is dom- 
inant and the mutations are gain-of-function”. There are nine 
reported Gsdma3 mutant alleles, resulting in 259RDW (insertion after 
residue 259 with mistranslated RDW sequence), T278P, L343P, 
Y344C, Y344H, A348T, I359N, premature stop at 366, and duplica- 
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tion of E411A412 (412EA), in the gasdermin-C domain’. Five of 
the mutations (T278P, L343P, Y344C, A348T and 412EA) were ana- 
lysed and found capable of instructing 293T cells to undergo 
pyroptosis (Fig. 5e). These mutations all disrupted or attenuated the 
co-immunoprecipitation between the gasdermin-N and -C domains 
of GSDMA3 (Fig. 5f). Thus, the alopecia-causing GSDMA3 mutants 
lose the autoinhibition and become constitutively active in triggering 
pyroptosis. Taken all together, our analyses further suggest that gas- 
dermins are all pyroptosis-inducing factors that are activated by 
release of the autoinhibition on their gasdermin-N domains. 


Discussion 


We identify GSIDMD as a generic substrate for caspase-1 and caspase- 
4/5/11. Cleavage of GIDMD by the inflammatory caspases critically 
determines pyroptosis by releasing the cleaved gasdermin-N domain 
that bears intrinsic pyroptosis-inducing activity. Further understand- 
ing the mechanism of action of GIDMD may provide a new avenue 
for therapeutic intervention of inflammatory-caspase-associated 
autoinflammatory conditions and septic shock. We also show that 
other gasdermins have a similar pyroptosis function; importantly, this 
observation redefines the concept of pyroptosis as gasdermin- 
mediated programmed necrosis. 

Caspases are classified into apoptotic and inflammatory caspases. 
Substrate targeting by these caspases is known to be promiscuous to 
some extent; overlapping substrate spectra for caspases of different 
natures have been noted in proteomic efforts that identified GSDMD 
as a substrate for both caspase-1 and apoptotic caspases*’**. Our data 
suggest that interdomain cleavage of GSDMD by caspase-1/4/5/11 
determines pyroptosis. Caspase-1 can also cleave caspase-3/7”', which 
may have a pyroptosis-independent function, particularly in the 
absence of GSDMD. Thus, the substrate cleaved by a caspase in a 
cellular context is the determinant for the nature of the cell death. 
Pyroptosis and necroptosis are two types of lytic and programmed 
necrosis. In necroptosis, a pseudokinase MLKL, upon phosphoryla- 
tion by RIPK3, acts as the executioner to rupture the membrane”. It is 
unclear at this stage whether GSDMD plays a similar ‘executioner 
role in pyroptosis, but our data suggest that GSDMD, or more spe- 
cifically its gasdermin-N domain, is sufficient to drive pyroptosis 
regardless of the cellular system. Given the possible presence of other 
GSDMD- and MLKL-like necrosis factors, our results suggest a new 
paradigm for understanding programmed necrosis. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. The experiments 
were not randomized and the investigators were not blinded to allocation during 
experiments and outcome assessment. 

Plasmids, antibodies and reagents. Complementary DNA (cDNA) for human 
GSDMD was amplified from reverse-transcribed cDNA of HT-29 cells; cDNAs 
for human GSDMB, human GSDMC and mouse Gsdma3 were synthesized by our 
in-house gene synthesis facility; cDNAs for human GSDMA and mouse Gsdmd 
were obtained from Vigene Biosciences (CH892815) and OriGene (MC202215), 
respectively. The gasdermin cDNAs were inserted into a modified pCS2-3 x Flag 
vector for transient expression in 293T cells and the pWPI lentiviral vector with 
an N-terminal 2x Flag—HA tag or the FUIGW vector with an N-terminal Flag tag 
for stable expression in HeLa and iBMDM cells. For recombinant expression in 
E. coli, the cDNAs were cloned into a modified pET vector with an N-terminal 
SUMO tag. Truncation mutants of the gasdermins were constructed by the 
standard PCR cloning strategy and inserted into the pCS2 vector with indicated 
tags. Expression plasmids for caspase-1, 4, 5 and 11 were previously described*’, 
the caspase-9 plasmid was a gift from X. Wang (National Institute of Biological 
Sciences, Beijing). cDNAs for human CASP2 and mouse Casp8 are from the Life 
Technologies Ultimate ORF collection and OriGene (MC200404), respectively. 
Point mutations were generated by the QuickChange Site-Directed Mutagenesis 
Kit (Stratagene). All plasmids were verified by DNA sequencing. 

Antibodies for caspase-1 p10 (sc-515), Myc epitope (sc-789) and GSDMD 
(sc-81868) were obtained from Santa Cruz Biotechnology. Other antibodies used 
in this study include anti-HA (MMS-101P, Covance), anti-Flag M2 (F4049), anti- 
actin (A2066) and anti-tubulin (T5168) (Sigma-Aldrich), rat monoclonal cas- 
pase-11 17D9 (NB120-10454, Novus Biologicals), anti-caspase-3 (#9662) and 
caspase-7 (#12827) (Cell Signaling Technology), IL-1 (3ZD; Biological 
Resources Branch, National Cancer Institute) and the antibody for detecting 
endogenous GSDMD (NBP2-33422, Novus Biologicals). Ultrapure LPS from 
E. coli O111:B4 and poly(dA:dT) were purchased from InvivoGen. LPS (L4524, 
for priming), TNFa and cycloheximide were purchased from Sigma-Aldrich. 
SMAC mimetic and the pan-caspase inhibitor zVAD are gifts from the laboratory 
of X. Wang (National Institute of Biological Sciences, Beijing). Nigericin was 
purchased from Calbiochem. Recombinant p20/p10 active caspase proteins (cas- 
pase-1/2/4/8/9) and lipid A (ALX-581-200-L001) were obtained from Enzo Life 
Sciences. Cell culture products are from Life technologies and all other chemicals 
used are Sigma-Aldrich products unless noted. 

Cell culture and transfection. HeLa, HT-29 and 293T cells were obtained from 
ATCC. C57BL/6 mice-derived wild-type and Tlr4'"iBMDM cells were kindly 
provided by K. A. Fitzgerald (University of Massachusetts Medical School, United 
States) and A. Ding (Weill Cornell Medical College, United States), respectively, 
and used in our previous studies*®**°. All the cell lines are well-established, com- 
monly used and frequently checked by virtue of their morphological features and 
functionalities, but have not been subjected to authentication by short tandem 
repeat (STR) profiling. All the cell lines have been tested to be mycoplasma- 
negative by the commonly used PCR method. iBMDM, HeLa and 293T cells 
were grown in Dulbecco’s modified Eagle’s medium (DMEM); HT-29 cells were 
grown in McCoy’s 5a modified medium. All media were supplemented with 10% 
(vol/vol) fetal bovine serum (FBS) and 2 mM L-glutamine. All cells were grown at 
37 °C in a 5% CO, incubator. Transient transfection of HeLa and 293T cells was 
performed using the JetPRIME (Polyplus Transfection) or Vigofect (Vigorous) 
reagents by following the manufacturers’ instructions. For stable expression, 
lentiviral plasmids harbouring the desired gene were first transfected into 293T 
cells together with the packing plasmids pSPAX2 and pMD2G with a ratio of 
5:3:2. The supernatants were collected 48 h after transfection and used to infect 
HeLa or iBMDM cells for another 48 h. GFP-positive infected cells were sorted by 
flow cytometry (BD Biosciences FACSAria II). For siRNA knockdown, 0.5 pl of 
20uM siRNA together with 0.81 of INTERFERin reagents (Polyplus 
Transfection) were used for reverse transfection of iBMDM cells in the 96-well 
plate format; 5 pil of 20 1M siRNA and 10 pl of INTERFERin reagents were used 
to transfect HeLa cells in the 6-well plate format. The knockdown was performed 
for 60h before subsequent analyses. The knockdown efficiency was assessed by 
quantitative real-time PCR (qRT-PCR) analyses as previously described*. All 
siRNA oligonucleotides were synthesized by our in-house facility using the 
sequences from the MISSION shRNA library (Broad Institute, United States) 
and their sequences are listed in Supplementary Table 1. 

Inflammasome activation assays. Activation of the canonical caspase-1 inflam- 
masomes (the NLRP3, NAIP-NLRC4 and AIM2 inflammasomes) and the non- 
canonical caspase-11 inflammasome by LPS was performed using the protocols 
that have been detailed in our previous publications*®°”°. For bacteria-induced 
inflammasome activation, S. typhimurium (wild-type and AsipD), B. thailandensis 


(wild-type and AbipB), EPEC (wild-type and AescN) were used to infect iBMDM 
cells and S. typhimurium (wild-type and AsifA) was used to infect HeLa cells, as 
described previously*”. 

Microscopy imaging of cell death. To examine cell death morphology, cells were 
treated as indicated in the 6-well plates (Nunc products, Thermo Fisher Scientific 
Inc.) for static image capture or in glass-bottom culture dishes (MatTek 
Corporation) for live imaging. Static bright field images of pyroptotic cells were 
captured using an Olympus IX71 or a Zeiss Pascal Confocal microscope. The 
image pictures were processed using ImageJ or the LSM Image Examiner pro- 
gram. Live images of cell death were recorded with the PerkinElmer UltraVIEW 
spinning disk confocal microscopy and processed in the software Volocity. All 
image data shown were representative of at least three randomly selected fields. 
Genome-wide CRISPR-Cas9 screens. The lentiviral gRNA plasmid library for 
genome-wide CRISPR-Cas9 screen was obtained from Addgene (#50947); amp- 
lification of the library and preparation of the lentivirus were performed following 
the protocol provided by Addgene. In brief, 1 il of library DNA (10 ng pl’) was 
used to transform 25 pl of electrocompetent E. coli (TaKaRa). Transformed 
colonies (>6 X 10”) were scraped off the Luria-Bertani (LB) plates into the media, 
and plasmids were exacted by using the GoldHi EndoFree Plasmid Maxi Kit 
(CWBIO). To prepare the virus library, 293T cells in the 15-cm dish were trans- 
fected with 25 1g of library DNA together with 15 j1g of psPAX2 and 10 pg of 
pMD2.G. Eight hours after transfection, the media were changed to high-serum 
DMEM (20% FBS with 25 mM HEPES). Another 40 h later, the media 
(from twenty 15-cm dishes of transfected cells) were collected and centrifuged 
at 3,000 r.p.m. for 10 min. The supernatant was filtered through a 0.22-um 
membrane and aliquots of 30 ml were stored at —80 °C. 

In the pilot experiment, the volume of the lentivirus library required for 
achieving an MOI of 0.3 for infecting the target cell line was determined in the 
12-well plate format. For the large scale screen, Tir4-/~ iBMDM cells stably 
expressing the Cas9 protein were seeded in the 15-cm dish (2 X 10° cells in 
20 ml media per dish) and a total of 2 X 10” cells were infected with the gRNA 
lentivirus library. Sixty hours after infection, cells were re-seeded at a density of 
1X 10°ml7! in fresh media supplemented with 5 1g ml~ ‘puromycin (to elim- 
inate non-infected cells). After 6 to 8 days, ~3 X 10° cells from five culture dishes 
were electroporated with LPS to trigger caspase-11-mediated pyroptosis’, or 
stimulated with LFn-BsaK/protective antigen (PA) to induce caspase-1-mediated 
pyroptosis‘; another 3 X 10° cells were left untreated as the control sample. Each 
screen was repeated another time. Surviving cells were collected after growing to 
near 90% confluence and lysed in the SNET buffer (20 mM Tris-HCL(pH 8.0), 
5 mM EDTA, 400 mM NaCl, 400 pg ml ! Proteinase K and 1% SDS). Genomic 
DNAs of each group of cells were prepared by using the phenol-chloroform 
extraction and isopropanol precipitation method. The DNA was dissolved in 
H20 (4-5 ug ul!) and used as the templates for amplification of the gRNA. 

The gRNAs were amplified by a two-step PCR method using the Titanium Taq 
DNA polymerase (Clontech Laboratories). In the first step, six 50-p1l PCR reac- 
tions (each containing 50 jg of genomic DNA template) were performed with the 
forward primer 50bp-F and the reverse primer 50bp-R; the PCR program used is 
94° C for 180s, 16 cycles of 94° C for 30 s, 60° C for 10s and 72 ° C for 25s, anda 
final 2-min extension at 68°C. Products of the first-step PCR were pooled 
together and used as the template for the second-step PCR. Also six 50-p] PCR 
reactions (each containing 1 Ul of the first-step PCR product) were performed 
with the forward primer Index-F and one of the reverse primers (Index-R1 to R6): 
Index-R1 for the control sample, Index-R2 for the replicate control sample, 
Index-R3 for the caspase-11 screen, Index-R4 for the replicate caspase-11 screen, 
Index-R5 for the caspase-1 screen and Index-R6 for the replicate caspase-1 
screen. The PCR program used is 94°C for 180s, 18 cycles of 94°C for 30s, 
54° C for 10s and 72 ° C for 18 s, anda final 2-min extension at 68 ° C. Products of 
the second-step PCR reactions were subjected to electrophoresis on the 1.5% 
agarose gel; the DNAs (the 310-bp band) were extracted and sequenced at the 
HiSeq2500 instrument (Illumina) by using the 50-bp single-end sequencing pro- 
tocol. The first 19 nucleotides from each sequencing read are the gRNA sequence 
recovered from the library. The frequency of each gRNA was obtained by dividing 
the gRNA read number by the total sample read number; the fold of enrichment 
was calculated by comparing the frequency of each gRNA in the experiment 
sample with that in the control sample. Sequences for all the primers are listed 
in Supplementary Table 1. 

The top 50 gRNA hits from the caspase-11 screen were examined and 18 genes 
that are conserved in human and mouse were identified for siRNA knockdown 
validation in HeLa cells. HeLa cells expressed caspase-4 but not caspase-5 
(Extended Data Fig. 1b) and respond robustly to cytosolic LPS”. For each gene, 
a mixture of two independent siRNAs was used and the knockdown efficiency of 
12 of those having mRNA expression in HeLa cells was confirmed. Importantly, 
only siRNAs targeting human GSDMD, besides the control CASP4-targeting 
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siRNA, could efficiently block cytosolic LPS-induced pyroptosis (Extended Data 
Fig. 1c). When assayed individually, the two GSDMD-targeting siRNAs both 
showed potent inhibition of HeLa cell pyroptosis (Extended Data Fig. 1d). 
Generation of CRISPR-Cas9 knockout cell lines. Human codon-optimized 
Cas9 (hCas9) and GFP-targeting gRNA-expressing plasmids (gRNA_GFP-T1) 
were purchased from Addgene. The 19-bp GFP-targeting sequence in the gRNA 
vector was replaced with the sequence targeting the desired gene by QuickChange 
site-directed mutagenesis. The target sequences used are AGCATCCTGGC 
ATTCCGAG for mouse Gsdmd and TTCCACTTCTACGATGCCA for human 
GSDMD. To construct the knockout cell lines, 1 ug of gRNA-expressing plasmid, 
3 ug of hCas9 plasmid and 1 yg of pEGFP-Cl vector were co-transfected into 
6 X 10° iBMDM or HeLa cells. Three days later, GFP-positive cells were sorted 
into single clones into the 96-well plate by flow cytometry using the BD 
Biosciences FACSAria II or the Beckman Coulter MoFlo XDP cell sorter. 
Single clones were screened by the T7 endonuclease I-cutting assay and the 
candidate knockout clones were verified by sequencing of the PCR fragments 
as described previously’. The PCR primers used are listed in Supplementary 
Table 1. 

Knockout mice and primary BMDM cells. All animal experiments were con- 
ducted following the Ministry of Health national guidelines for housing and care 
of laboratory animals and performed in accordance with institutional regulations 
after review and approval by the Institutional Animal Care and Use Committee at 
National Institute of Biological Sciences. The Gsdmd knockout mice were gener- 
ated by co-microinjection of in vitro-translated Cas9 mRNA and gRNA into the 
C57BL/6 zygotes. Founders with frameshift mutations were screened with T7E1 
assay and validated by DNA sequencing. Founders were intercrossed to generate 
biallelic Gsdmd~/~ mice. The gRNA sequence used to generate the knockout 
mice is AGCATCCTGGCATTCCGAG. C57BL/6 wild-type mice were from 
Vital River Laboratory Animal Technology Co. and Casp1/11~/~ mice were 
obtained from the Jackson Laboratory. Ripk3 ‘~ mice were a gift from X. 
Wang (National Institute of Biological Sciences, Beijing). Primary BMDM cells 
were prepared from 6-week-old male mice (C57BL/6 background) by following a 
standard procedure as previously described®. For each experimental design, at 
least two mice were chosen to prepare the BMDM cells for assaying the inflam- 
masome responses; the mice were not randomized and the investigators were 
not blinded. 

Cytotoxicity assay and IL-1B ELISA. Relevant cells were treated as indicated. 
Cell death was measured by the LDH assay using CytoTox 96 Non-Radioactive 
Cytotoxicity Assay kit (Promega). Cell viability was determined by the CellTiter- 
Glo Luminescent Cell Viability Assay (Promega). To measure IL-1 release, 
primary BMDM cells were primed with LPS (1 1g ml~') for 2h and released 
mature IL-1B was determined by using the IL-1 ELISA kit (Neobioscience 
Technology Company). 

Purification of recombinant proteins. To obtain recombinant human GSDMD, 
E. coli BL21 (DE3) cells harbouring pET28a-Hisg-SUMO-GSDMD were grown 
in LB medium supplemented with 30 pg ml’ kanamycin. Protein expression was 
induced overnight at 18° C with 0.4 mM isopropyl-B-p-thiogalactopyranoside 
(IPTG) after ODgo9 nm reached 0.8. Cells were harvested and resuspended in a 
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lysis buffer containing 20 mM Tris-HCl (pH 8.0), 150 mM NaCl, 20 mM imida- 
zole and 10 mM 2-mercaptoethanol. The Hisg-SUMO-tagged protein was first 
purified by affinity chromatography using Ni-NTA beads (Qiagen) and the 
SUMO tag was removed by overnight ULP1 protease digestion at 4°C. The 
cleaved GSDMD was further purified by HiTrap Q ion-exchange and Superdex 
G200 gel-filtration chromatography (GE Healthcare Life Sciences). 

To obtain the constitutive-active caspase-11 p20/p10 tetramer, cDNAs encod- 

ing the p20 large and p10 small subunit were cloned into pET21a with a 6 X His tag 
fused to the C terminus of the p10 subunit. The two subunits were separately 
expressed in E. coli with 1 mM IPTG induction for 4 h at 30 ° C. Bacteria collected 
from 1-1 culture were resuspended and lysed in 100 ml of lysis buffer (50 mM Tris- 
HCl (pH 8.0), 150 mM NaCl and 10 mM 2-mercaptoethanol) by sonication. 
Inclusion bodies, obtained by centrifugation of the lysates at 18,000 r.p.m. for 
1h, was washed with 50 ml of Buffer 1 (50 mM Tris-HCl (pH 8.0), 300 mM NaCl, 
1 M guanidinium hydrochloride (GdnCl) and 0.1% Triton X-100) and 50 ml of 
Buffer 2 (50 mM Tris-HCl (pH 8.0), 300 mM NaCl and 1 M GdnCl) twice for each 
buffer. The washed inclusion bodies were solubilized by stirring in 6 ml of the 
solubilization buffer containing 6.5 M GdnCl, 25 mM Tris-HCl (pH 7.5), 5 mM 
EDTA and 100 mM DTT overnight at room temperature. To obtain active 
p20/p10 tetramers by refolding, 12 ml of above solubilized inclusion body solu- 
tion containing denatured large and small subunits (molecular ratio, 1:2) were 
drop-by-drop diluted in 500 ml of refolding buffer (100 mM HEPES, 100 mM 
NaCl, 100 mM sodium malonate, 20% sucrose, 0.1 M NDSB-201 and 10 mM 
DTT) and then gently stirred in a nitrogen atmosphere at 16 °C overnight. Protein 
aggregates were removed by centrifugation at 4,000 r.p.m. for 20 min and the 
refolded protein supernatants were concentrated and dialysed against a buffer 
containing 50 mM Tris-HCl (pH 8.0), 150 mM NaCl and 10 mM 2-mercaptoeth- 
anol. The protein was affinity-purified by the Ni-NTA beads and further purified 
by the Superdex G200 gel-filtration chromatography. Expression and purification 
of recombinant LFn—-BsaK and LFn-FliC proteins were described previously’. 
Recombinant full-length caspase-4 and caspase-11 were expressed and purified 
from insect cells also as previously described’. Recombinant PreScission protease 
(PPase) proteins are routine lab stocks. 
In vitro GSDMD cleavage by recombinant caspases. For cleavage by the p20/ 
p10 tetramers of active caspase, 5 tg of purified recombinant GSDMD was 
incubated with 1 unit of caspase-1, 2, 4, 8 and 9 or 0.1 pg of caspase-11 in a 
25-l reaction containing 50 mM HEPES (pH 7.5), 3 mM EDTA, 150 mM NaCl, 
0.005% (vol/vol) Tween-20 and 10 mM DTT. The reaction was incubated for 
60min at 37°C. For cleavage by LPS-activated caspase-4/11, the full-length 
caspase proteins purified from insect cells were first incubated with LPS, lipid 
A or MDP for 30 min at 30° C; 5 pg of purified recombinant GSDMD was then 
reacted with the ligand-incubated caspases at 37°C for 9min. Cleavage of 
GSDMD was examined by Coomassie blue staining of the reaction samples 
separated on the SDS-PAGE gel. 


33. Koike-Yusa, H., Li, Y., Tan, E.-P., del Castillo Velasco-Herrera, M. & Yusa, K. 
Genome-wide recessive genetic screening in mammalian cells with a lentiviral 
CRISPR-guide RNA library.Nature Biotechnol. 32, 267-273 (2014). 
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Extended Data Figure 1 | siRNA knockdown validation of the requirement 
of GSDMD for LPS- and LFn-BsaK-induced pyroptosis. a, LPS 
electroporation-induced pyroptosis in the absence of priming. Primary 
BMDMs (priBMDM) (wild-type (WT) or the Casp1 and Casp11 double 
knockout) or iBMDM cells (wild-type or Tlr4_'~) were assayed. 
Tlr4-'"iBMDMs were used for the CRISPR-Cas9 screen in this study. 

b, Reverse-transcription PCR analyses of caspase-5 expression in HeLa and 
293T cells. Plasmid harbouring caspase-5 cDNA serves as the positive control. 
c, siRNA knockdown validation of the CRISPR-Cas9 screen of LPS-induced 
pyroptosis. HeLa cells were used to validate the selected top hits from the 
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screen. Mixtures of two independent siRNA pairs targeting each gene were 
transfected into the cells. siRNAs targeting CASP4 and luciferase were used as 
the positive and negative control, respectively. d, Effects of GIDMD siRNA 
knockdown on LPS-induced pyroptosis in HeLa cells. e, f, Effects of Gsdmd 
siRNA knockdown on LPS and LFn-BsaK-induced pyroptosis in iB MDM 
cells. The knockdown efficiency (d, e) was measured by qRT-PCR analyses. 
ATP-based cell viability (a, c, d, f) and siRNA knockdown efficiency 

(d, e) were expressed as mean values + s.d. from three technical replicates. Data 
shown are representative of two (c) or three (a, b, d-f) independent 
experiments. 
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Extended Data Figure 2 | The gasdermin family of proteins in human and __ marks the caspase-1/4/11 cleavage motifs in human and mouse GIDMD. The 
mouse. a, Multiple sequence alignment of human GSDMA, GSDMB, GSDMC, __ residue number is indicated on the left of the sequence. b, Phylogenetic tree 
GSDMD and mouse GSDMD. The alignment was performed by using the of all the gasdermin family of proteins in human and mouse. ClustalW 
ClustalW2 algorithm and displayed with ESPript 3.0 (http://espript.ibcp.fr/ alignment was carried out to generate the phylogenetic tree by using the 
ESPript/cgi-bin/ESPript.cgi). Identical residues are highlighted by the dark red ‘Neighbor Joining’ method. DFNA5 and DFNBS59 are distantly related to the 
background and conserved residues are indicated by red font. The black box _ gasdermins, and the latter only contains gasdermin-N domain. 
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Extended Data Figure 3 | Generation of GSDMD-deficient cell lines and d, e, Complementation of GSDMD~'~ HeLa cells and Gsdmd~/~ iBMDM cells 
assays for cytosolic LPS- and LFn-BsaK-triggered pyroptosis. a,Generation _ by stably expressed mouse GIDMD-3 Flag. The accompanying blots show 


of GIDMD '~ HeLa cells and Gsdmd~'~ iBMDM cells by CRISPR-Cas9- the expression of exogenous GSDMD by anti-Flag immunoblotting with 
mediated targeting. Shown are the sequence mutations of the three HeLa anti-tubulin blots serving as the loading control. ATP-based cell viability is 
cell clones and two iBMDM clones used in the study. b, c, f, Effects of expressed as mean values + s.d. from three technical replicates (b-f). Data 
GSDMD ‘~ on LPS electroporation-induced pyroptosis in HeLa (b) and shown are representative of at least three independent experiments. 


iBMDM cells (c) and LFn-BsaK-induced pyroptosis in iBMDM cells (f). 
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a _LFn-BsaK CRISPR-Cas9 screen hits with multiple gRNAs 
Ranking Gene_gRNA # 2 sa Shane gRNA sequence 
1 Nirc4_2 3695.249_ GTTTCGAATAGTCCCCCCC 
2 = Nirc41 2797.553 AGGTGCCTCATGACCGCCC 
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6 Naip2_5 1599.965 TTCTGGTCGTAATTTCTTG 
7 Nirc4_3 1241.855 ATCTGCTCCTCTACACGTG 
8  Casp1_3 1143.169 AGGGCAAGACGTGTACGAG 
10 Casp1_1 877.4373 GAATTCTGGAGCTTCAATC 
12 Fbxo44_1 659.6165 ATCGCCTCCGTTCACGTCC 
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15 Naip2_4 554.2724. ACTGGCCCCACGAATCACC 
17 Naip2_2 315.3426 ACAGCCCCGGGTGATTCGT 
18  Casp1_4 246.0405 CAACTTGAGCTCCAACCCT 
21 Naip2_1 202.6801 |CTAGACTCGTATCTAGGTA 
36 9 Antxr2_4 67.18251 TGACGGACGGTAAGCTGGA 
57  Ccdc6_3 35.80221 CTCGCCGTGAATTACGAGA 
88  Ccdc6_5 23.54041 CTCGCCGTGAATTACGAGA 
138. Gsdmd_3 16.5958 © CAACAGCTTCGGAGTCGTG 
153  1700034J05Rik_4 15.40841 CTCTTCAGGAAACGAGATC 
171 Plekhg6_3 14.6932 | CAGACCATGGCTTATGCGC 
195 Bcl2/2_1 13.87979 CCTAACTGGGGCCGTCTTG 
203. Plekhg6_1 13.46514. ATAAATGGCCAGGTCCGAC 
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239 Antxr2_2 12.47934. GCTAGTGTTTACTGCGTTG 
338 Bcl2/2_5 9.98516 | CCTAACTGGGGCCGTCTTG 
340 1700034J05Rik_2 9.970983 CTGGCCTGGGATATCGATG 
352 Gsdmd_4 9.753432 AGCATCCTGGCATTCCGAG 
353 Gm97_5 9.727808 CTGGCTCACCTATATGCCC 
365 Gm974 9.490875 CTGGCTCACCTATATGCCC 
376 Ppp1r3b_3 9.245637 AGCCCCTGGTTGTCGGCGA 
380 Ppp1r3b_5 9.168484 __ ACAGTTTTCTAGGCAGACG 
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Extended Data Figure 4 | CRISPR-Cas9 screen of LEn-BsaK-triggered 
pyroptosis and effects of Gsdmd knockout on LFn-BsaK-induced and 
caspase-1-mediated caspase-3/7 cleavage. a, gRNA hits from a genome-wide 
CRISPR-Cas9 screen of LFn—BsaK-induced pyroptosis in mouse Tir4 ‘~ 
iBMDM cells. Shown are those genes with multiple gRNA hits. The ranking, the 
average fold increase and the sequences for each gRNA are listed. Genes 
highlighted in red encode known components in the pathway (Antxr2 encodes 
the endocytosis receptor for the LFn tag) and were hit by multiple gRNAs. 

b, siRNA knockdown validation of screen hits. Mixtures of two independent 


siRNA pairs targeting each gene were transfected into the iBMDM cells before 
stimulation with LFn—BsaK. siRNAs targeting Nirc4 and luciferase were used as 
the positive and negative control, respectively. ATP-based cell viability is 
expressed as mean values = s.d. from three technical replicates. c, Caspase-3/7 
activation upon prolonged LFn-BsaK treatment in wild-type and Gsdmd '~ 
(the KO-1 clone; KO, knockout) iBMDM cells. Cell lysates were analysed 

by anti-caspase-3/7 and tubulin immunoblotting. Data shown are 
representative of two (b) and three (c) independent experiments. 
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Extended Data Figure 5 | GSDMD is not required for and not cleaved in 
TNFa-induced necroptosis and apoptosis. a-c, Effects of GIDMD 
knockdown on LPS electroporation-induced pyroptosis and TSZ-induced 
necroptosis in HT-29 cells. Two independent GSDMD-targeting siRNAs (#1 
and #2) were assayed and the immunoblots in a show the knockdown 
efficiency. d, e, Primary BMDM cells from Gsdmd~'~ or other indicated mouse 
strains were stimulated with TSZ (d) or LPS + zVAD (e) to trigger necroptosis. 
f, The absence of GSDMD cleavage in TNFa-induced apoptosis and 
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necroptosis. 2 Flag-HA-GSDMD was stably expressed in HeLa and HT-29 
cells. Apoptosis was induced by TNFa + CHX treatment in HeLa cells and 
necroptosis was induced by TSZ stimulation of HT-29 cells. Lysates of 
stimulated cells were analysed by anti-Flag and anti-tubulin immunoblotting 
to examine possible GSDMD cleavage. GSDMD-FL, full-length GSDMD. 
ATP-based cell viability is expressed as mean values + s.d. from three technical 
replicates (b-f). Data shown are representative of at least two independent 
experiments. 
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Extended Data Figure 6 | Generation of Gsdmd~’~ mice and assays for 
inflammasome-mediated caspase-1 autoprocessing and secretion. 

a, Gsdmd ‘~ mice were generated by CRISPR-Cas9-mdiated targeting. Shown 
are the sequence mutations in the two homozygous F1 lines (F1-1 and F1-2) 
used in the study. b, Anti-caspase-1/caspase-11 immunoblots of lysates of 
unstimulated primary BMDM cells derived from wild-type and Gsdmd'~ 
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mice. c, Primary BMDMs derived from wild-type or Gsdmd~‘~ mice were 
stimulated with indicated canonical inflammasome stimuli or infected with 
S. typhimurium (wild type or the T3SS-deficient AsipD mutant). Total cell 
lysates or the culture supernatants were subjected to anti-caspase-1 or anti- 
tubulin immunoblotting. Data shown (b, c) are representative of two 
independent experiments. 
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Extended Data Figure 7 | Specific cleavage of GSDMD by inflammatory 
caspases. a, Effects of Gsdmd knockout on caspase-1 activation by the AIM2 
inflammasome. Indicated iBMDM cells were stimulated by poly(dA:dT) 
transfection. b, Effects of the pan-caspase inhibitor zVAD on LPS 
electroporation- and LFn—-BsaK-induced GSDMD cleavage in HeLa and 
iBMDM cells, respectively. ATP-based cell viability is expressed as mean values 
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+ s.d. from three technical replicates. c, Assays of GIDMD cleavage by 
inflammatory and apoptotic caspases overexpressed in cells. 3X Flag-GSDMD 
was co-transfected with indicated Myc-caspase into 293T cells. Total cell 
lysates were analysed by anti-caspase-1 (a, b), anti-Flag (b, c), anti-Myc 

(c) and anti-tubulin (a—-c) immunoblotting. Data shown are representative of 
three independent experiments. 
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Extended Data Figure 8 | Resistance of the GSDMD D/A mutant to HA-GSDMD (wild-type or the D/A mutant) iBMDM cells were infected with 
inflammatory-caspase cleavage. a, b, Assays of proteolytic cleavage of the wild-type S. typhimurium (c), B. thailandensis or EPEC (d) to induce caspase-1 
GSDMD D/A mutant by overexpression-activated inflammatory caspases. activation (by the NAIP-NLRC4 inflammasome), or their T3SS-deficient 
3X Flag-tagged mouse (a) or human (b) GSDMD (wild-type or the D/A mutant strains (AsipD, AbipB and AescN, respectively) as controls. Cell lysates 


mutant) was co-transfected with Myc-tagged caspase-1/11 (a) or caspasel/4/5/ _ were analysed by anti-caspase-1, anti-tubulin and anti-Flag immunoblotting. 
11 (b) into 293T cells. Cell lysates were analysed by anti-Flag, anti-Myc and p10, mature caspase-1. GIDMD-FL, full-length GIDMD; GSDMD-N, the 
anti-tubulin immunoblotting. c, d, Assays of proteolytic cleavage of GDMD _N-terminal cleavage product of GIDMD. The D/A mutants refer to D275A 
D/A mutant by bacterial-infection-activated caspase-1. Wild-type, Gsdmd for human GSDMD and D276A for mouse GSDMD. Data shown are 
knockout (the KO-1 clone), or Gsdmd KO-1 complemented with 2 Flag- representative of three independent experiments. 
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Extended Data Figure 9 | Characterization of GSDMA3 and other immunoblots of cell lysates to examine GSDMA3 cleavage and the lower panel 
gasdermin family members. a, Flag-tagged GIDMA, GSDMB, GSDMC and __ shows ATP-based cell viability expressed as mean values + s.d. from three 
GSDMD were co-transfected with caspase-1 (upper panel) or caspase-11 technical replicates. c, The absence of GIDMA3 cleavage in TNFa-induced 
(lower panel) into 293T cells. Cell lysates were analysed by anti-Flag, anti- apoptosis and necroptosis. Flag~-GSDMA3 was expressed in HeLa and 


caspase-1, anti-Myc or anti-tubulin immunoblotting. b, Wild-type GSDMA3 _L929 cells. Apoptosis was induced by TNFa + CHX treatment in HeLa cells 
ora GSDMA3-mutant harbouring a PPase cleavage site between its gasdermin- _and necroptosis was induced by TSZ stimulation of L929 cells. Lysates of 

N and -C domain was expressed in 293T cells. Recombinant PPase was stimulated cells were analysed by anti-Flag and anti-tubulin immunoblotting. 
transfected into the cells by electroporation. The upper panel shows the Data shown are representative of three independent experiments. 
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Caspase-11 cleaves gasdermin D for 
non-canonical inflammasome signalling 


Nobuhiko Kayagaki', Irma B. Stowe!, Bettina L. Lee!, Karen O’Rourke', Keith Anderson”, Seren Warming’, Trinna Cuellar’, 
Benjamin Haley*, Merone Roose-Girma?’, Qui T. Phung’, Peter S. Liu’, Jennie R. Lill’, Hong Li’, Jiansheng Wu’, 

Sarah Kummerfeld*, Juan Zhang”, Wyne P. Lee?, Scott J. Snipas®, Guy S. Salvesen®, Lucy X. Morris’, Linda Fitzgerald’, 
Yafei Zhang’, Edward M. Bertram”’®, Christopher C. Goodnow®?!° & Vishva M. Dixit! 


Intracellular lipopolysaccharide from Gram-negative bacteria including Escherichia coli, Salmonella typhimurium, Shigella 
flexneri, and Burkholderia thailandensis activates mouse caspase-11, causing pyroptotic cell death, interleukin-1B 
processing, and lethal septic shock. How caspase-11 executes these downstream signalling events is largely unknown. 
Here we show that gasdermin D is essential for caspase-11-dependent pyroptosis and interleukin-1f$ maturation. A 
forward genetic screen with ethyl-N-nitrosourea-mutagenized mice links Gsdmd to the intracellular lipopolysaccharide 
response. Macrophages from Gsdmd_'~ mice generated by gene targeting also exhibit defective pyroptosis and interleukin-1p 
secretion induced by cytoplasmic lipopolysaccharide or Gram-negative bacteria. In addition, Gsdmd~ ‘~ mice are protected 
from a lethal dose of lipopolysaccharide. Mechanistically, caspase-11 cleaves gasdermin D, and the resulting amino-terminal 
fragment promotes both pyroptosis and NLRP3-dependent activation of caspase-1 in a cell-intrinsic manner. Our data 
identify gasdermin D as a critical target of caspase-11 and a key mediator of the host response against Gram-negative bacteria. 


Cytoplasmic caspase-11 (also known as caspase-4) defines the non- 
canonical inflammasome that is activated by various Gram-negative 
bacterial infections and causes infected cells to die by pyroptosis’*. 
Caspase-11 also triggers NLRP3- and ASC-dependent activation of 
caspase-1, resulting in the proteolytic maturation of the inactive 
cytokine pro-interleukin (IL)-1B (refs 1, 5-7). Lipopolysaccharide 
(LPS) is the bacterial pathogen-associated molecular pattern that pro- 
motes caspase-11 activation”®. Cytoplasmic LPS can directly bind to 
and activate caspase-11 independently of Toll-like receptor 4 (TLR4)”, 
but precisely how caspase-11 executes pyroptosis and non-canonical 
NLRP3 inflammasome activation has remained unknown. 


Phenomics identifies gasdermin D 


An unbiased forward genetic screen with ethyl-N-nitrosourea (ENU)- 
mutagenized mice’ was performed to unveil essential mediators of non- 
canonical inflammasome signalling in response to cytoplasmic LPS. 
Pedigree IGL1351 had a Mendelian recessive mutation that compro- 
mised LPS-induced IL-1 secretion from peritoneal macrophages. The 
severity of the defect was equivalent to that seen with the Casp11 mutant 
129 strain’ (Fig. 1a and Extended Data Fig. 1). Importantly, IL-1 secre- 
tion by pedigree IGL1351 in response to Clostridium difficile toxin B 
(TcdB), a stimulant of the Mediterranean fever/Pyrin canonical inflam- 
masome™, was comparable to other pedigrees. Exome sequencing iden- 
tified 39 single-nucleotide variants (SNVs) in the ENU-mutated founder 
G1 male. Subsequent genotyping of all identified SNVs in a large cohort 
of siblings and offspring revealed that the trait was completely correlated 
with inheritance of a point mutation in the gene encoding gasdermin D 
(Gsdmd) (Extended Data Fig. 1 and Extended Data Table 1). 

Gsdmd (ref. 12) is a 487 amino acid cytoplasmic protein that contains 
an ill-characterized gasdermin domain and lacks any obvious signal 


peptide or transmembrane segments (Fig. 1b). The human gasdermin 
gene family consists of GIDMA, GSDMB, GSDMC and GSDMD. The 
physiological role of Gsdmd is unknown”, although other gasdermin 
family members have been implicated in deafness or apoptosis in epi- 
thelial cells'*'*. The ENU-derived mutation in Gsdmd caused an Ile to 
Asn amino acid substitution at position 105 with a Polyphen score of 0.98 
(probably damaging) and a SIFT (sorting intolerant from tolerant) score 
of 0.00 (deleterious) (Fig. 1b and Extended Data Table 1). Full-length 
Gsdmd protein (53 kDa) was immunoblotted in unstimulated wild-type 
bone-marrow-derived macrophages (BMDMs). Unlike pro-caspase-11, 
Gsdmd was not induced to any great extent by TLR agonists or inter- 
ferons (Fig. 1c). Wild-type and Gsdmd'!"°" BMDMs expressed 
comparable amounts of Gsdmd (Fig. 1d), suggesting that the IL05N 
mutation impairs Gsdmd function rather than protein expression. 

Wild-type, Casp11~’ ~, or Gsdmd2N BMDMs that were 
primed with Pam3CSK4 and then stimulated with canonical inflam- 
masome stimuli, including ATP to engage the NLRP3-dependent 
inflammasome’, poly(dA:dT) double-stranded DNA (dsDNA) to 
activate the AIM2-dependent inflammasome’*”’, or TcdB"’, secreted 
comparable amounts of IL-1B and exhibited similar levels of pyrop- 
totic death’ (Fig. le). Consistent with these data, Gsdmd NOON 
BMDMs displayed normal induction of NLRP3, pro-IL-1, and 
RANTES cytokine in response to TLR agonists (Fig. 1d, Extended 
Data Fig. 2a). Gsdmd"°'"°" BMDMs also upregulated pro- 
caspase-11 normally (Fig. 1d) but, in contrast to their wild-type coun- 
terparts, were defective in IL-1B secretion, and did not undergo 
pyroptosis in response to cytoplasmic LPS (Fig. le). Proteolytic pro- 
cessing of caspase-1 and pro-IL-1B preceding IL-1 secretion was 
also impaired, suggesting that Gsdmd is critical for all caspase-11- 
dependent signalling events (Extended Data Fig. 2b). 
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Figure 1 | Gsdmd mutation I105N abolishes non-canonical inflammasome 
signalling. a, Screening of third-generation offspring from ENU-treated 
C57BL/6 mice. Graphs indicate IL-1f released from peritoneal macrophages 
cultured for 16h. Circles indicate Casp11 mutant 129 (black), IGL1351-11 
(red), and other G3 offspring (grey). Centre bars represent averages. 

b, Schematic of mouse Gsdmd. Gasdermin domain is grey (Pfam PF04598). 


Gsdmd is essential for caspase-11 signalling 


We confirmed a critical role for Gsdmd in non-canonical inflammasome 
signalling using gene-targeted Gsdmd ’~ mice. In control experiments, 
Pam3CSK4-primed._ wild-type, Gsdmd~’~, and Casp11’ ~ BMDMs 
exhibited equivalent caspase-1-dependent IL-1 secretion and pyrop- 
tosis after a 16 h treatment with TcdB, ATP, dsDNA, or the NLRC4 
inflammasome trigger flagellin'* (Fig. 2a and Extended Data Fig. 3a, b). 
The effect of Gsdmd deficiency on caspase-1-dependent pyroptosis at 
earlier time points is discussed in a later section. Caspase-11 or Gsdmd 
deficiency also had no effect on IL-1B secretion in response to other 
canonical NLRP3 inflammasome stimuli including listeriolysin O (LLO) 
toxin, monosodium urate (MSU), lysomotropic agent HLLOMe, or cal- 
cium pyrophosphate (CPPD)!””-** (Extended Data Fig. 3c, d). In keeping 
with these data, TLR agonists induced NLRP3, pro-caspase-11, and 
RANTES to a similar extent in wild-type and Gsdmd ‘~ BMDMs 
(Extended Data Fig. 3a, e). Therefore, Gsdmd is dispensable for normal 
TLR priming. Gsdmd ‘~ BMDMs were, however, resistant to pyroptosis 
and failed to secrete IL-1 when transfected with E. coli LPS or synthetic 
monophosphory] lipid A, the active moiety of LPS”* (Fig. 2a, Extended 
Data Fig. 3b). Gsdmd~‘~ BMDMs were also unresponsive to electro- 
porated LPS, or LPS transfected with cholera toxin B complex’, or 
S. typhimurium LPS (Extended Data Fig. 3f). Gsdmd, like Casp11, was 
also required for processing of pro-caspase-1 and pro-IL-1f in response 
to cytoplasmic LPS (Fig. 2b). Finally, Gsdmd ‘~ BMDMs further 
resembled Casp11~'~ BMDMs in failing to secrete IL-1B or undergo 
pyroptosis upon infection with Gram-negative bacteria (E. coli, 
Citrobacter rodentium, or Shigella flexneri), whereas both genotypes 


Pam3CSkKé4 priming 


c, d, Immunoblots of wild-type (WT) BMDMs (c) or the genotypes indicated 
(d) after 6 h culture. e, IL-1 and LDH released from BMDMs after 16h. 
Graphs show mean + s.d. of triplicate wells and represent three independent 
experiments. LPS Ctb, LPS plus cholera toxin B complex. For source gels of 
c and d, see Supplementary Fig. 1. 


responded normally to Pseudomonas aeruginosa that activates the 
NLRC4 canonical inflammasome’*””* (Fig. 2c). Collectively, these data 
confirm that macrophages require Gsdmd for pyroptosis and IL-1B 
secretion in response to intracellular LPS. 

Human caspase-4 and caspase-5, the orthologues of mouse cas- 
pase-11, induce pyroptosis in myeloid and endothelial cells in res- 
ponse to intracellular LPS’”*. Like caspase-4/5/11, Gsdmd is specific to 
mammals'***, suggesting a co-emergence of those genes. Human 
GSDMD exhibits 58% amino acid sequence identity with mouse 
Gsdmd. Deletion of GSIDMD or CASP4 by CRISPR/Cas9 in human 
EA.hy926 endothelial cells or human THP-1 monocytes did not affect 
death induced by APO2L”* or dsDNA, but it completely abolished 
intracellular LPS-induced cytotoxicity (Fig. 2d, Extended Data Fig. 
3g-i). Thus, human GSDMD is required for the caspase-4-dependent 
response to cytoplasmic LPS. 


Caspase-11 cleaves and activates Gsdmd 

We next questioned how Gsdmd promotes pyroptosis and caspase-1 
activation by the non-canonical inflammasome. We noticed that 
BMDMs transfected with LPS exhibited caspase-11-dependent cleav- 
age of Gsdmd, giving rise to a 30-kDa N-terminal Gsdmd fragment 
(Fig. 3a). These data identified Gsdmd as a potential substrate of cas- 
pase-11. Recombinant caspase-11 cleaves artificial peptide substrates 
after an aspartate residue’””’, but physiological substrates of caspase-11 
are not well defined. Purified recombinant Gsdmd was cleaved by 
recombinant caspase-11 in vitro into two fragments. Edman sequen- 
cing mapped the cleavage site in Gsdmd to LLSD276| Go77 (Fig. 3b). The 
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Figure 2 | Gsdmd is essential for non-canonical inflammasome signalling. 
a, c, IL-1B and LDH released from BMDMs after 16h. P. aeruginosa infection 
was analysed after 4h. Gsdmd ‘~' and Gsdmd_‘~? are independent knockout 
strains. b, Immunoblots of pooled cell extracts and supernatants from BMDMs 
at 6 h after electroporation with LPS. For source gels, see Supplementary Fig. 1. 


P,-Asp cleavage residue and P,'-Gly residues are conserved across 
species (Extended Data Fig. 4a). Ectopic expression of the 
N-terminal Gsdmd cleavage fragment (mouse residues 1-276) killed 
HEK293T cells, whereas expression of full-length Gsdmd or the 
C-terminal cleavage fragment did not (Fig. 3c). Collectively, these 
data indicate that caspase-11 directly cleaves the 53-kDa inactive 
precursor form of Gsdmd (pro-Gsdmd) to generate the pro- 
pyroptotic N-terminal fragment (Gsdmd p30, Fig. 1b). In full-length 
Gsdmd, the C terminus may mask key sites in the N terminus until 
cleavage at D276| G77 releases this inhibition. Consistent with this 
notion, reconstitution of immortalized Gsdmd'~ macrophages with 
wild-type Gsdmd restored LPS responsiveness, as evidenced by lact- 
ate dehydrogenase (LDH) release, IL-1B secretion and caspase-1 pro- 
cessing, whereas reconstitution with the Gsdmd processing mutant 
D276A did not (Fig. 3d, e). In addition, mutation of Gsdmd within the 
p30 fragment (I105N) attenuated pyroptosis and caspase-1/IL-1B 
processing (Fig. 1b, e and Extended Data Fig. 2b) without affecting 
Gsdmd cleavage (Extended Data Fig. 4b). These data confirm that 
conversion of pro-Gsdmd to its mature p30 form by caspase-11 is 
essential for both pyroptosis and caspase-1 activation in response to 
cytoplasmic LPS. 

Cytoplasmic LPS also caused caspase-4-dependent processing of 
GSDMD in human THP-1 monocytes (Extended Data Fig. 5a). 
Furthermore, transient overexpression of caspase-4 or caspase-5 caused 
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d, Cytotoxicity of APO2L or transfected LPS on control, GSDMD-knockout, or 
CASP4-knockout EA.hy926 cells after 16h. Graphs show mean + s.d. of 
triplicate wells and represent three independent experiments. gRNA, single- 
guide RNA. 


human GSDMD cleavage to the pro-pyroptotic p30 fragment (Extended 
Data Fig. 5b and c). These data together with the conserved P and P,’ 
residues in GSDMD (Extended Data Fig. 4a) indicate a conserved mech- 
anism of GSDMD activation by caspase-4/5/11. 

Cytoplasmic LPS induces caspase-11-dependent pyroptosis and 
secretion of IL-1B, but only the latter requires caspase-1, NLRP3 
and ASC’”. Loss of NLRP3, ASC, or caspase-1 in BMDMs did not 
markedly affect caspase-11-dependent generation of mature Gsdmd 
p30 in response to cytoplasmic LPS (Fig. 4a), indicating that Gsdmd 
cleavage occurs upstream of NLRP3 inflammasome activation. The 
question arises as to whether Gsdmd induces pyroptosis and NLRP3 
inflammasome activation simultaneously in the same cell, or if 
Gsdmd-mediated pyroptosis releases damage-associated molecular 
patterns” that trigger caspase-11-independent NLRP3 inflamma- 
some activation in neighbouring cells (Extended Data Fig. 6a). We 
believe the latter hypothesis is unlikely because LPS-stimulated 
Tl1b-'~ BMDMs exhibited normal pyroptosis, yet their culture super- 
natant failed to induce IL-1 secretion from Casp11~'~ BMDMs 
(Extended Data Fig. 6b, c). In addition, Casp11~' ~ BMDMs co-cul- 
tured with 111b-'” BMDMs did not secrete IL-1 in response to 
cytoplasmic LPS (Fig. 4b and Extended Data Fig. 6d), making it 
unlikely that a paracrine signalling mechanism activates the NLRP3 
inflammasome”’. Collectively, our data demonstrate that Gsdmd mat- 
uration by caspase-11 triggers two distinct cell-intrinsic signals: 
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Figure 3 | Caspase-11 cleaves Gsdmd. a, Immunoblots of BMDM extracts 
and supernatants at 6 h after LPS electroporation. Non-specific bands are 
indicated with asterisks. b, Coomassie blue staining of Gsdmd incubated with 
caspase-11. N terminus of the band marked with an asterisk is shown. 

c, Cytotoxicity of mouse Gsdmd in HEK293Ts. Numbers indicate nanograms 
of plasmid transfected. d, IL-1B and LDH released from reconstituted 


(i) pyroptosis induction; and (ii) NLRP3-dependent caspase-1 
activation (Fig. 4c). 


Role for Gsdmd in canonical pyroptosis 


Human caspase-1 is reported to cleave GSDMD at the same P)-Asp 
residue”! as caspase-11, albeit that the preferred tetrapeptide cleavage 
sequence of caspase-1 differs from that of caspase-117””*. Consistent 
with this study*', the Gsdmd p30 fragment appeared in BMDMs 
treated with canonical inflammasome activators including ATP 
(Fig. 5a and Extended Data Fig. 7a). Although Gsdmd~'~ BMDMs 
stimulated with ATP or flagellin underwent pyroptosis normally after 
8 h, they exhibited less pyroptosis than wild-type BMDMs at earlier 


Gsdmd~'~ immortalized macrophages at 16h after LPS transfection. 

e, Immunoblots of immortalized macrophage extracts and supernatants 

in d. Non-specific band is indicated with an asterisk. Graphs show mean = s.d. 
of triplicate wells and represent three independent experiments. For source gels 
of a, c, and e, see Supplementary Fig. 1. 


time points (Figs 2a and 5c and Extended Data Fig. 7b). These data 
suggest that Gsdmd is also a physiological substrate of caspase-1. 
However, given that caspase-1-dependent pyroptosis is delayed rather 
than prevented, we believe that other ill-defined caspase-1 substrates 
can also mediate pyroptosis (Fig. 5d). This notion is in keeping with 
the fact that caspase-1 orthologues are found in vertebrates including 
non-mammals, whereas caspase-4/5/11 and Gsdmd are exclusive to 
mammals'*”**?, Consistent with IL-1B being a direct substrate of 
caspase-1**, Gsdmd deficiency did not impact caspase-1-dependent 
IL-1 processing in response to ATP (Fig. 5b). However, Gsdmd /~ 
BMDMs secreted less IL-1 than wild-type at the 2-h time points in 
response to ATP and flagellin (Extended Data Fig. 7c). Gsdmd may 
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Figure 4 | Cell-intrinsic NLRP3 inflammasome activation. a, Immunoblots 
of BMDM extracts and supernatants at 6 h after electroporation with LPS. Non- 
specific band is indicated with an asterisk. For source gels, see Supplementary 
Fig. 1. b, IL-1B released from 1:1 mixed BMDM cultures at 16h after 


stimulation. c, Model for non-canonical inflammasome signalling. Graph 
shows mean ~ s.d. of triplicate wells and represents three independent 
experiments. 
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Figure 5 | Role of Gsdmd in canonical inflammasome signalling. 

a, b, Immunoblots of BMDM extracts and supernatants after stimulation with 
ATP for 8 h (a) or as indicated (b). Non-specific band is indicated with an 
asterisk. For source gels, see Supplementary Fig. 2. c, LDH released from 


contribute to IL-1 release by damaging cell membranes or regulating 
the ill-characterized IL-1 secretion mechanism**”. 


Gsdmmd is essential for lethal sepsis 


We investigated the relevance of our in vitro findings in a mouse 
model of acute septic shock wherein caspase-11-dependent pyrop- 
tosis leads to lethal endotoxemia’’*. Mice lacking either Casp11 or 
Gsdmd were resistant to LPS-induced lethal septic shock (Fig. 6 and 
Extended Data Table 2). This result is consistent with Gsdmd being a 
critical pro-pyroptotic substrate of caspase-11 in vivo. 


Discussion 

Pyroptosis plays an important role in anti-bacterial innate immune 
defence and lethal endotoxemia’’’**, but how inflammatory caspases 
1, 4, 5 and 11 cause pyroptosis has remained unknown. Our data reveal 
that proteolytic cleavage of Gsdmd by mouse caspase-11 or human 
caspase-4 is essential for pyroptosis of innate immune cells and endothe- 
lial cells harbouring LPS-tainted cytoplasm. Cleaved Gsdmd also triggers 
NLRP3-dependent activation of caspase-1 through a cell-intrinsic path- 
way (Fig. 4c), although the exact mechanism responsible for NLRP3 


100 ®@® ow 
a ~@- Casp11-- 
S 80 -@ Gsdma“~ 
= 604 
2 
S 40) 
ip) 
20 | 
0 : ; : 1 
0 20 40 #460 80 100 


Hour 


Figure 6 | Gsdmd deficiency protects against lethal sepsis. Kaplan-Meier 
survival plots for mice (n = 8 for each genotype) challenged with 54mgkg * 
LPS. Gsdmd~'~ °'? mice were used (described in Extended Data Fig. 8). 
Statistical analysis and P values (Extended Data Table 2) were adjusted to 
account for multiple comparisons using Bonferroni’s correction. 
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BMDMsafter LPS electroporation or the treatments indicated. Gsdmd ‘~* and 
Gsdmd '~* are independent knockout strains. d, Model for canonical 
inflammasome signalling. Graphs show mean + s.d. of triplicate wells and 
represent three independent experiments. 


activation remains unclear. NLRP3 senses diverse stimuli that perturb 
intracellular homeostasis*””’”’, so its activation may be an indirect con- 
sequence of the effects of the Gsdmd p30 fragment. Future studies will 
need to address exactly what the p30 fragment does in cells. The re- 
defined non-canonical inflammasome signalling pathway (Fig. 4c) 
refutes the previous hypothesis that a caspase-1/11 complex is respons- 
ible for caspase-1 activation’”®. 

The partial contribution of Gsdmd to caspase-1-dependent pyrop- 
tosis suggests the existence of other pro-pyroptotic caspase-1 sub- 
strates (Fig. 5d). Caspase-1 can trigger pyroptosis in the absence of 
Gsdmd, which is not unexpected because non-mammalian verte- 
brates lack Gsdmd yet still exhibit caspase-1-dependent pyrop- 
tosis'‘*’*"?°, The co-emergence of caspase-4/5/11 and Gsdmd genes 
in the early stages of mammalian evolution might indicate co-evolu- 
tion, perhaps in response to the need for additional anti-pathogen 
responses against Gram-negative bacteria. 

Like other caspases, caspase-11 cleaves multiple substrates 
However, our in vitro and in vivo data show that caspase-11 relies 
exclusively on Gsdmd to promote pyroptosis, caspase-1 activation 
and LPS-induced lethal sepsis. Thus Gsdmd is revealed as an 
unexpected, but critical facet of the anti-bacterial response. 


28,31,40 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 


Received 14 July; accepted 4 September 2015. 


Published online 16 September 2015. 
1. Kayagaki, N. et a/. Non-canonical inflammasome activation targets caspase-11. 
Nature 479, 117-121 (2011). 

Broz, P. et al. Caspase-11 increases susceptibility to Salmonella infection in the 
absence of caspase-1. Nature 490, 288-291 (2012). 

Rathinam, V. A. et al. TRIF licenses caspase-11-dependent NLRP3 inflammasome 
activation by gram-negative bacteria. Cel! 150, 606-619 (2012). 

Aachoui, Y. et al. Caspase-11 protects against bacteria that escape the vacuole. 
Science 339, 975-978 (2013). 

Mariathasan, S. et al. Differential activation of the inflammasome by caspase-1 
adaptors ASC and Ipaf. Nature 430, 213-218 (2004). 

Mariathasan, S. et a/. Cryopyrin activates the inflammasome in response to toxins 
and ATP. Nature 440, 228-232 (2006). 


2° 


al 


©2015 Macmillan Publishers Limited. All rights reserved 


Kayagaki, N. et a/. Noncanonical inflammasome activation by intracellular LPS 
independent of TLR4. Science 341, 1246-1249 (2013). 

Hagar, J. A., Powell, D. A, Aachoui, Y., Ernst, R. K. & Miao, E. A. Cytoplasmic LPS 
activates caspase-11: implications in TLR4-independent endotoxic shock. Science 
341, 1250-1253 (2013). 

Shi, J. et al. Inflammatory caspases are innate immune receptors for intracellular 
LPS. Nature 514, 187-192 (2014). 


. Nelms, K. A. & Goodnow, C. C. Genome-wide ENU mutagenesis to reveal immune 


regulators. Immunity 15, 409-418 (2001). 


. Xu, H. et al. Innate immune sensing of bacterial modifications of Rho GTPases by 


the Pyrin inflammasome. Nature 513, 237-241 (2014). 


. Katoh, M. & Katoh, M. Identification and characterization of human DFNAS5L, 


mouse Dfna5l, and rat Dfna5! genes in silico. Int. J. Oncol. 25, 765-770 (2004). 


. Fujii, T. etal. Gasdermin D (Gsdmd) is dispensable for mouse intestinal epithelium 


development. Genesis 46, 418-423 (2008). 


. Tamura, M. etal. Members of a novel gene family, Gsdm, are expressed exclusively 


in the epithelium of the skin and gastrointestinal tract in a highly tissue-specific 
manner. Genomics 89, 618-629 (2007). 


. Roberts, T. L. et al. HIN-200 proteins regulate caspase activation in response to 


foreign cytoplasmic DNA. Science 323, 1057-1060 (2009). 


. Fernandes-Alnemri, T. et al. The AIM2 inflammasome is critical for innate 


immunity to Francisella tularensis. Nature Immunol. 11, 385-393 (2010). 


. Hornung, V. et a/. AIM2 recognizes cytosolic dsDNA and forms a caspase-1- 


activating inflammasome with ASC. Nature 458, 514-518 (2009). 


. Franchi, L. et al. Cytosolic flagellin requires Ipaf for activation of caspase-1 and 


interleukin 1B in salmonella-infected macrophages. Nature Immunol. 7, 576-582 
(2006). 


. Meixenberger, K. et al. Listeria monocytogenes-infected human peripheral blood 


mononuclear cells produce IL-1, depending on listeriolysin O and NLRP3. 
J. Immunol. 184, 922-930 (2010). 


. Schroder, K. & Tschopp, J. The inflammasomes. Cel! 140, 821-832 (2010). 
. Martinon, F., Petrilli, V., Mayor, A, Tardivel, A. & Tschopp, J. Gout-associated 


uric acid crystals activate the NALP3 inflammasome. Nature 440, 237-241 
(2006). 


. Hornung, V. et al. Silica crystals and aluminum salts activate the NALP3 


inflammasome through phagosomal destabilization. Nature /mmunol. 9, 847-856 
(2008). 


. Sutterwala, F. S. et a/. Immune recognition of Pseudomonas aeruginosa mediated 


by the IPAF/NLRC4 inflammasome. J. Exp. Med. 204, 3235-3245 (2007). 


. Casson, C. N. et al. Human caspase-4 mediates noncanonical inflammasome 


activation against gram-negative bacterial pathogens. Proc. Nat! Acad. Sci. USA 
112, 6688-6693 (2015). 


. Sakamaki, K. & Satou, Y. Caspases: evolutionary aspects of their functions in 


vertebrates. J. Fish Biol. 74, 727-753 (2009). 


. Ashkenazi, A. & Dixit, V. M. Death receptors: signaling and modulation. Science 


281, 1305-1308 (1998). 


. Thornberry, N. A. etal. A combinatorial approach defines specificities of members 


of the caspase family and granzyme B. Functional relationships established for key 
mediators of apoptosis. J. Biol. Chem. 272, 17907-17911 (1997). 


ARTICLE 


28. Kang, S. J. et a/. Dual role of caspase-11 in mediating activation of caspase-1 
and caspase-3 under pathological conditions. J. Cell Biol. 149, 613-622 
(2000). 

29. Kaczmarek, A., Vandenabeele, P. & Krysko, D. V. Necroptosis: the release of 
damage-associated molecular patterns and its physiological relevance. /mmunity 
38, 209-223 (2013). 

30. Ruhl, S. & Broz, P. Caspase-11 activates a canonical NLRP3 inflammasome by 
promoting K* efflux. Eur. J. Immunol. http://dx.doi.org/10.1002/eji.201545772 
(2015). 

31. Agard, N. J., Maltby, D. & Wells, J. A. Inflammatory stimuli regulate caspase 
substrate profiles. Mol. Cell. Proteomics 9, 880-893 (2010). 

32. Angosto, D. et al. Evolution of inflammasome functions in vertebrates: 
Inflammasome and caspase-1 trigger fish macrophage cell death but are 
dispensable for the processing of IL-1. /nnate /mmun. 18, 815-824 (2012). 

33. Thornberry, N. A. et a/. A novel heterodimeric cysteine protease is required for 
interleukin-1 beta processing in monocytes. Nature 356, 768-774 (1992). 

34. Eder, C. Mechanisms of interleukin-1f release. /mmunobiology 214, 543-553 
(2009). 

35. Liu, T. et al. Single-cell imaging of caspase-1 dynamics reveals an all-or-none 
inflammasome signaling response. Cell Rep. 8, 974-982 (2014). 

36. Wang, S. eta/. Murine caspase-11, an ICE-interacting protease, is essential for the 
activation of ICE. Cell 92, 501-509 (1998). 

37. Miao, E. A. et al. Caspase-1-induced pyroptosis is an innate immune effector 
mechanism against intracellular bacteria. Nature Immunol. 11, 1136-1142 
(2010). 

38. Bergsbaken, T., Fink, S. L. & Cookson, B. T. Pyroptosis: host cell death and 
inflammation. Nature Rev. Microbiol. 7, 99-109 (2009). 

39. Xie, H. X. et al. Edwardsiella tarda-induced cytotoxicity depends on its type III 
secretion system and flagellin. Infect. Immun. 82, 3436-3445 (2014). 

40. Py, B. F. et al. Caspase-11 controls interleukin-18 release through degradation of 
TRPC1. Cell Rep. 6, 1122-1128 (2014). 


Supplementary Information is available in the online version of the paper. 


Acknowledgements We thank the staff of the Australian Phenomics Facility, 
Genentech Transgenic Technology and FACS cores, and K. Bowman, J. Payandeh, 

E. Dueber, R. Aglietti, A: Gupta and A. Peterson for technical expertise and discussion, 
K. Newton for manuscript editing, A. Muszynski, L. S. Forsberg, and R. W. Carlson for 
S. typhimurium LPS. Most authors were employees of Genentech, Inc. 


Author Contributions N.K.,.B.S.,B.LL, K.0., T.C.,B.H., P.S.L,, Q.T.P., J.R.L, H.L., J.W., S.K., 
J.Z., W.P.L, S.J.S., L.X.M., L.F., Y.Z.and E.M.B. designed and performed experiments. K.A. 
and S.W. generated Gsdmd~’~ mice. S.W. and M.R.-G. generated Casp1~/~ mice. N.K. 
and E.M.B. prepared the manuscript. N.K., G.S.S., E.M.B., C.C.G. and V.M.D. contributed 
to the study design and data analyses. 


Author Information Reprints and permissions information is available at 
www.nature.com/reprints. The authors declare no competing financial interests. 
Readers are welcome to comment on the online version of the paper. Correspondence 
and requests for materials should be addressed to N.K. (kayagaki@gene.com) or V.M.D. 
(dixit@gene.com). 


29 OCTOBER 2015 | VOL 526 | NATURE | 671 


©2015 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


METHODS 

ENU-mutagenized mouse strains. Third-generation (G3) offspring used for the 
phenotypic screen were from ENU-treated C57BL/6 mice as described"®. 129X1/ 
SvJ strain was used as a Casp11 mutant' control. All animals were housed under 
specific-pathogen-free conditions at the Australian Phenomics Facility. Animal 
experiments were approved by the Australian National University Animal Ethics 
and Experimentation Committee. 

Exome capture and sequencing of G1 founder. Exome-enriched, paired-end 
libraries were prepared from genomic DNA of first generation (G1) mice using 
the Agilent SureSelect XT2 Mouse All Exon kit (Agilent) following the manufac- 
turer’s instructions. Each sample was prepared with an index and then pooled in a 
batch of six in equimolar amounts before exome enrichment. Each 6-plex exome- 
enriched library was sequenced in one lane of an Illumina HiSeq2000 with version 2 
chemistry as 100-bp paired-end reads. ENU variants were identified as described 
previously”. In brief, sequence reads were mapped to the GRCm38 assembly of the 
reference mouse genome using the default parameters of the Burrows-Wheeler 
Aligner (BWA, http://bio-bwa.sourceforge.net). Untrimmed reads were aligned 
allowing a maximum of two sequence mismatches and were discarded where they 
aligned to the genome more than once. Sequence variants were identified with 
SAMtools (http://samtools.sourceforge.net) and annotated using Annovar (http:// 
www.openbioinformatics.org). PolyPhen2 (http://genetics.bwh.harvard.edu/pph2) 
and SIFT (http://siftjcviorg/) were used for the calculation of variant effect. 
Variants were filtered to prioritize novel variants not in dbsnp (or in a list of common 
mouse variants identified by the pipeline) and predicted to be deleterious to the 
protein by PolyPhen and/or SIFT. For variant validation, G1 samples were genotyped 
for the ENU variants identified by the exome sequencing using a competitive, allele- 
specific dual FRET-based assay, KASP (LGC). For genotyping of G3 offspring and 
IGL1351 pedigree, primers were designed based on the SNV locus-flanking sequence. 
Plates were read in a FLUOStar Optima (BMG Labtechnologies) plate reader. 
Other mice. Nirp3 ’~, Asc “~, and Casp11-‘~ mice on a C57BL/6N background 
were described previously’. I11b ’~ mice generated from AB2.2 ES cells (129S7) were 
obtained from Taconic and backcrossed to C57BL/6J up to the N10 generation. 
C57BL/6-type Casp11 genotype was confirmed by PCR as described'. Gsdind-/~ 
mice were obtained by cytoplasmic injection of C57BL/6N zygotes with 25 ng pl 
wild-type Cas9 mRNA (Life Technologies) and 13ngpl~' in vitro-transcribed 
single-guide RNA (gRNA) prepared by MEGAshortscript T7 Kit (Life Tech- 
nologies). Tail DNA from resulting offspring was analysed by PCR and sequencing. 
Target sequences of gRNA are 5'-GAGAAGGGAAAATTTCTGG (G9) and 5’- 
AGGG 

CAGAGTGATGTTGTC (G2). Allele sequences of Gsdmd ‘~ mice used are pro- 
vided in Extended Data Fig. 8. Casp1”'~ mice (Extended Data Fig. 7d) lacking exons 
1 and 2 were generated at Genentech from gene-targeted C57BL/6N C2 ES cells. 
Casp1~'~ mice were genotyped with PCR primers (5'-CCTGAATCTTAGACCAA 
GTTGAG; 5'-AGGCAGAAGGAATAGGAATAGT and 5'-CCAGGTATCTAAT 
CACAATGGT) yielding a 167-bp wild-type DNA fragment and a 287-bp mutant 
DNA fragment. The Genentech Institutional Animal Care and Use Committee 
approved all animal studies. 

Reagents and antibodies. Ultra-pure LPS (E. coli O111:B4), Pam3CSK4, 
poly(I:C) LMW, R837 (Imiquimod), dsDNA (poly(dA:dT)), ultra-pure flagellin 
(P. aeruginosa), MSU, and CPPD were from Invivogen. Other reagents were 
synthetic monophosphoryl lipid A (Enzo Life Sciences), TcdB, cholera toxin B 
(Ctb; List Biological Laboratories), LLO toxin (ReagentProteins), HLLOMe 
(Chem Impex International), IFN-«, (PBL Assay Science), IFN-y (eBioscience), 
and ATP (Sigma). LPS from S. typhimurium was described previously?”. 
Antibodies used include: mouse caspase-1 (clone 4B4, Genentech), human cas- 
pase-1 (Bally-1, AdipoGen), caspase-4 (4B9, Enzo Life Sciences), caspase-5 (4429, 
Cell Signaling Technology), caspase-11 (clone 17D9, Novus Biologicals), Gsdmd 
(G7422 anti-Gsdmd aa 126-138, Sigma), IL-1 (GTX74034, GeneTex), NLRP3 
(clone Cryo-2, AdipoGen), and Flag epitope (M2, Sigma). 

Macrophage cultures. Bone marrow cells were differentiated in DMEM supple- 
mented with 10% endotoxin-free fetal bovine serum (Omega Scientific) and 20% 
M-CSF-conditioned medium for 5-6 days. Adherent BMDMs or thioglycollate- 
elicited peritoneal macrophages were cultured overnight in 96-well plates at 1 X 
10° cells per ml before being primed for 5-6 h with 1 pg ml” ' Pam3CSK4 in OPTI- 
MEM (Life Technologies). Primed cells were transfected with 2 gml~’ LPS or 
5 ugml * lipid A by using 0.25% v/v FuGENE HD (Promega) or 20 1g ml * Ctb, 
or electroporated with 2 jg ml” ' LPS by using the 4D-Nucleofector system (Lonza) 
as described previously’. Unless specified, FUGENE HD was used for the LPS 
transfection. The other conditions for inflammasome activations were 5mM 
ATP, 100 ng ml! TcdB, 2 Lg ml! dsDNA plus 0.1% v/v Lipofectamine 2000 
(Life Technologies), 0.5 pg ml flagellin plus 0.25% v/v FuGENE HD, 1 mg ml? 
MSU, 500 pgml~' CPPD, and 1 mM HLLOMe. Infections with P. aeruginosa 
(ATCC 27853; multiplicity of infection (MOI) 25), E. coli (ATCC 11775, MOI 


30), C. rodentium (ATCC 51116, MOI 20), or S. flexneri (ATCC 9199, MOI 20) 
were for 1.5h and then cultures were supplemented with 100 1g ml” ' Gentamycin 
(Life Technologies). For stimulation with TLR agonists or interferons, BMDMs 
were cultured with 1 pg ml! Pam3CSK4, 5 ug ml! polyL:C, 5 pg ml! LPS, 
2 ug ml’ R848, 75 ng ml" IFN-o, or 500 ng ml_' IFN-y for the indicated periods. 
ELISAs were used to measure IL-1B (Meso Scale Discovery) in culture superna- 
tants. A CytoTox 96 Non-Radioactive Cytotoxicity assay (Promega) measured cell 
death. For measuring RANTES, Luminex assay (Bio-Rad) was used. Each figure 
legend shows the number of samples per experiment and number of experiments 
that were analysed independently. For immunoblotting, cells were lysed with RIPA 
buffer (50mM _ Tris-HCl pH 7.4, 150mM NaCl, 1mM EDTA, 1X Complete 
Protease Inhibitor (Roche), 1% Triton X-100, 0.1% SDS). For cleavage studies 
(Gsdmd p30, caspase-1 p20, and IL-1 p17), pooled culture supernatants and cell 
extracts were precipitated with 7.2% trichloroacetic acid plus 0.15% sodium deoxy- 
cholate. 

Recombinant proteins. DNA encoding Flag-tagged caspase-11 (residues 2-373) 
and His6-MBP-tagged murine Gsdmd (residues 2-487) were generated by gene 
synthesis, and subcloned into a modified version of pAcgp67 (BD Biosciences). 
Transfer vectors were co-transfected with BestBac linearized viral DNA 
(Expression Systems) into Sf9 cells (Life Technologies) using Transit-IT-Insect 
transfection reagent (Mirus Bio) to produce recombinant baculovirus. The virus 
was amplified twice to prepare the stock used for protein expression. Protein was 
expressed in Tni pro cells (Expression Systems). A 22 1 Wave bioreactor was 
inoculated with Tni pro cells at 1 X 10° cells per ml in serum-free ESF921 
(Expression Systems). At 48h post infection, the Tni cells were harvested and 
frozen at — 80 °C. For the purification of Gsdmd, the baculovirus cell pellets were 
thawed and lysed in lysis buffer (50 mM Tris-HCl pH 8.0, 150 mM NaCl, 20 mM 
imidazole, 1 mM TCEP plus Protease Inhibitor Cocktail (Roche)). The cell lysate 
was loaded onto a3 ml NiNTA column (QIAGEN) pre-equilibrated with buffer A 
(20mM Tris pH 8.0, 300 mM NaCl, 20mM imidazole, 0.5mM TCEP). The 
column was washed with 20 column volumes buffer A, then eluted with 5 column 
volumes of buffer B (20 mM Tris-HCl pH 8.0, 300 mM NaCl, 250 mM imidazole, 
0.5mM TCEP). The elutant was then loaded onto a 5ml MBP column (GE 
Healthcare Life Sciences) pre-equilibrated with buffer C (20mM Tris-HCl pH 
7.5, 150mM NaCl, 10% glycerol, 0.5 mM TCEP). The MBP column was washed 
with 15 column volumes of buffer C. Gsdmd protein was eluted with buffer C 
containing 10mM maltose. To cleave off His-MBP tag, purified His6-MBP- 
tagged Gsdmd was digested with TEV (Sigma). Un-cleaved Gsdmd and TEV 
enzyme were removed by passing through a 3 ml NiNTA column. For the puri- 
fication of caspase-11, the baculovirus cell pellets were thawed and lysed in lysis 
buffer (50mM Tris-HCl pH 7.5, 150 mM NaCl, 0.5mM TCEP plus protease 
inhibitor cocktail) for 1h. The cell lysate was loaded onto a 4ml Flag column 
(Sigma) pre-equilibrated with buffer D (50 mM Tris-HCl pH 7.5, 150 mM NaCl, 
10% glycerol, 0.5 mM TCEP). The column was washed with 5 column volumes of 
buffer C, then 20 column volumes of buffer E (buffer D plus 0.1% TritonX-114) 
followed with 15 column volumes of Buffer D. Caspase-11 was eluted with buffer 
F (buffer D plus 0.2 mg ml! 3XFlag peptide (Sigma)). The purified recombinant 
caspase-11 was then concentrated and further purified by a HiLoad 16/60 
Superdex 200 column with buffer G (20mM Tris-HCl pH 7.5, 150mM NaCl, 
10% glycerol). Purified recombinant caspase-11 and Gsdmd were characterized 
by SDS-PAGE and mass spectrometry, and stored at —20 °C. 

Recombinant Gsdmd processing assay. Recombinant Gsdmd (4 1.M) was incu- 
bated in the presence or absence of 21M recombinant caspase-11 at 37 °C for 
30 min in standard caspase reaction buffer (20 mM PIPES pH 7.2, 10% sucrose, 
100 mM NaCl, 0.1% CHAPS, 1mM EDTA, 10mM DTT). Samples were then 
subjected to SDS-page followed by Coomassie blue staining (Expedeon). For 
Edman degradation analysis, samples were blotted onto a PVDF after SDS-page 
electrophoresis, and the band corresponding to the Gsdmd protein fragment was 
subjected to N-terminal sequencing using traditional Edman chemistries“* on a 
494HT Procise sequencer (Applied Biosystems) with fast cycles*. 

Plasmids and transient expression. cDNAs encoding Gsdmd (human and 
mouse), human caspase-4, and caspase-5 were artificially synthesized with a Flag 
epitope and subcloned into pcDNA3.1/Zeo(+) (Life Technologies) for transient 
expression in HEK293T cells, or into pLenti6.3/V5-DEST (Life Technologies) for 
lentivirus production. Human codon-optimized Cas9 was synthesized and sub- 
cloned into pLenti7.3 (Life Technologies). Mutagenesis of cDNAs was performed 
with a QuikChange site-directed mutagenesis kit (Agilent Technologies). For tran- 
sient expression, HEK293T cells (ATCC) were cultured overnight in 96-well plates at 
1.2 X 10° cells per ml, then transfected with 2-10 ng of plasmids by using 0.16 pl 
Lipofectamine 2000. At 24h after transfection, cytotoxicity against HEK293T cells 
was measured by CellTiter-Glo (Promega). For human GSDMD processing, 
HEK293T cells that stably express human GSDMD were selected using 1 mgml * 
Zeocin (Invivogen) after transfection with human GSDMD plasmid. The stable 
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transfectant cells were transiently transfected with 20 ng of human caspase-4 or -5 
plasmids as described above. At 24h after transfection, cells were lysed with RIPA 
buffer for immunoblotting. HEK293T cell lines used were regularly tested for myco- 
plasma. All cell lines used were authenticated by short tandem repeat (STR) profiling, 
single nucleotide polymorphism (SNP) fingerprinting, and mycoplasma testing. 
Knockout cell lines. Human EA-hy926 (ATCC) endothelial cells and human 
THP-1 (ATCC) monocytes were lentivirally transduced with Cas9 and single-cell 
cloned. gRNAs were transduced into Cas9* cloned lines by lentiviral delivery with 
pLKO.1 vector (Sigma) followed by selection of gRNA-expressing cells by Puromycin 
(Life Technologies). At 11-14 days post-selection, cells were harvested for experi- 
ments. The target sequences of gRNA used are 5'-GITGGCTTCCTGGGCTGCTA 
for human CD81 (control), 5’-GCATGCGAGAATCTCACGC for luciferase (con- 
trol), 5'-GGTGTTTTGGATAACTTGG for CASP4 knockout, 5'-GTAGTCCGGA 
GAGTGGTCC for human GSDMD knockout (gRNA1), and 5’-AACCACCAG 
GCAGTAGGGC for human GSDMD knockout (gRNA2). EA.hy926 cells were 
transfected with 10 pgml~’ LPS by 0.5% v/v Lipofectamine 2000 or treated with 
100 ngml~' APO2L (R&D systems). At 24h after stimulation, cytotoxicity against 
EA hy926 was measured by CellTiter-Glo. THP- 1 cells were primed with 0.5 pg ml! 
Pam3CSK4 for 4h, then plated in 96-well plates at 1 X 10° cells per ml with OPTI- 
MEM containing 0.5 igml~’ Pam3CSK4, followed by transfection of 2 4gml~* 
dsDNA with 0.125% v/v Lipofectamine 2000. For LPS electroporation into 
Pam3CSK4-primed THP-1 cells, the Neon transfection system (Life Technologies) 
was used with 1 X 10° cells plus LPS 0.1 ig per chamber condition. All the cell lines 
used were regularly tested for mycoplasma. All cell lines used were authenticated by 
STR profiling, SNP fingerprinting, and mycoplasma testing. 

Gsdmd reconstitution. Macrophage progenitor cells from Gsdmd ’~° mice 
(Extended Data Fig. 8) were immortalized by ER-Hoxb8 as described previously***”. 
The immortalized macrophage progenitor cells were lentivirally transduced with 
cDNA encoding either wild-type or D276A mouse Gsdmd using pLenti6.3/V5- 
DEST vector. Cells were selected in 6.25ygml ‘ Blasticidin (Invitrogen) and 
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differentiated in M-CSF-containing conditioned media. Stimulation of 
ER-Hoxb8-immortalized macrophages was performed as described above. All 
Hoxb cell lines used were regularly tested for mycoplasma. 

Endotoxic shock model. Female mice aged 8-10 weeks were injected intraperito- 
neally with 54 mg kg” * LPS (E. coli 0111:B4, Sigma) and monitored eight times daily 
for a total of 6 days. Gsdmd~/~ mice used (Gsdmd~/~ 13) are described in Extended 
Data Fig. 8. Statistical analysis was performed with log-rank (Mantel-Cox) tests using 
Prism, and P values were adjusted to account for multiple comparisons using 
Bonferroni’s correction. Sample sizes were chosen by standard methods to ensure 
adequate power and no mice were excluded from the analysis. No randomization or 
blinding was used for the animal studies. No statistical method was used to estimate 
sample size. The Genentech Institutional Animal Care and Use Committee approved 
all animal studies. 
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Extended Data Figure 1 | Homozygosity for a Gsdmd point mutation 


genotypes mutated in the pedigree and their phenotypes (bottom). Hits 


represent mice whose peritoneal macrophages showed attenuated LPS-induced 
IL-1 secretion to a similar extent as Casp11 mutant 129 strain. Mutant, 


homozygous. 


correlates with unresponsiveness to cytoplasmic LPS. Gsdmd genotypes and 
screen phenotypes of mice derived from IGL1351 pedigree (top). Only the 


identification numbers of screened animals are shown. Thirty-nine SNV 
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Extended Data Figure 2 | Gsdmd"°“" BMDMs respond normally to 
TLR agonists. a, RANTES production from BMDMs cultured for 16h with 
medium alone (cont) or the TLR stimulants indicated. b, Western blots of 


BMDM extracts and supernatants at 6h after LPS electroporation. Graph 
shows the mean + s.d. of triplicate wells and represents three independent 
experiments. For source gel, see Supplementary Fig. 2. 
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Extended Data Figure 3 | Non-canonical inflammasome signalling requires _g, i, Western blots of EA-hy926 (g) or THP-1 cells (i). h, LDH released from 
Gsdmd. a, Western blots of BMDMs cultured with or without Pam3CSK4 control (parental or luciferase gRNA), GSDMD knockout (KO), or CASP4 
for 6h. Gsdmd ‘~' and Gsdmd ‘~? are independent knockout strains. b-d, knockout THP-1 cells at 16 h after LPS electroporation or dsDNA transfection. 
IL-1 and LDH released from BMDMs after 16h. P. aeruginosa infection was | Graphs show mean = s.d. of triplicate wells and represent three independent 
analysed at 4h. e, RANTES production from BMDMs after 16h. f, IL-1B experiments. For source gels of a, g and i, see Supplementary 

and LDH released from BMDMs at 16h after LPS electroporation, LPS plus Information Fig. 2. 

cholera toxin B (Ctb) complex, or S. typhimurium LPS transfection. 
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Extended Data Figure 4 | Gsdmd cleavage site. a, Alignment of Gsdmd 
amino acid sequences. The conserved Gsdmd cleavage site (D276 in mouse) is 
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boxed in red. b, Immunoblots of BMDM extracts and supernatants at 16 h after 
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LPS/Ctb treatment. A non-specific band is indicated with an asterisk. Cont, Ctb 
alone. For source gel, see Supplementary Fig. 3. 
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Extended Data Figure 5 | Human GSDMD processing. a, Western blots of 
THP-1 extracts and supernatants at 3h after LPS electroporation. b, 

Western blots of human GSDMD/HEK293T stable transfectants at 24h after 
transient transfection of the indicated plasmids. A non-specific band is 
indicated with an asterisk. c, Cytotoxicity of the human proteins indicated at 
24h after transient transfection of HEK293T cells. Numbers indicate 


nanograms of plasmid transfected. Western blots (right) indicate protein 
expression. Expression of the N-terminal fragment (1-275) was below 
detection levels, presumably due to its potent toxicity. Graph shows mean + 
s.d. of triplicate wells and represents three independent experiments. For source 
gels of a, b and c, see Supplementary Fig. 3. 
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Extended Data Figure 6 | BMDMs stimulated with cytoplasmic LPS do not 
release NLRP3-stimulating damage-associated molecular pattern activity. 
a, Paracrine signalling hypothesis. DAMP, damage-associated molecular 
pattern. b, LDH released from BMDMs at 16h after LPS transfection 

or electroporation. c, IL-1 released from Casp11 ~/~ BMDMsat 16h after 
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stimulation with ATP or incubation with I/1b-‘~ BMDM culture supernatants 
derived in b. d, LDH released from 1:1 mixed cultures of the indicated BMDMs 
at 16h after stimulation. Graphs show mean + s.d. of triplicate wells and 
represent three independent experiments. 


©2015 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


a 


Pam3CSK4-priming 


d 


wt Casp1~— Pam3CSK4: — rs 
a Vv 
a OM Jw Sé 
2 fea 3 2 fea 3 voy hy x 3 x 
Eon eno Rod RS 
SSE BB SS5E RRS yf SPY PS 
we wareeD| | SO =e ewew = [pro-Casp! 
1 Sony i Gsdmd 
St ee es ‘ai = ces ti = pro-Casp11 
actin 
ee | actin — 
ext+sup F 
Cc —aveerenrens actin 
-O- wt 
-@- Caspt~ -@ Gsdmd’-$ 
~@ Casp11~ -@ Gsdma** 
8,000 6,000 16,000 
= 6,000 12,000 
S 4,000 
a 
eo. 4,000 8,000 
% 2,000 
= 2,000 4,000 
0 = rec ©. 


Oh 2h 4h 8h 16h 
LPS 


Oh 2h 4h 8h 16h 


ATP 


Oh 


ST 
2h 4h 8h 16h 
flagellin 


Pam3CSK4-priming 


Extended Data Figure 7 | Canonical inflammasome stimuli induce caspase- 
1-dependent processing of Gsdmd. a, Western blots of BMDM extracts and 
supernatants at 8h after stimulation. Cont, medium alone. LPS, LPS + Ctb. 

Asterisk indicates a non-specific band. b, Western blots of Gidmd~/~ BMDMs. 
Gsdmd~'~* and Gsdmd ~~“ are independent knockout strains (Extended Data 


Fig. 8). c, IL-1 released from BMDMs. LPS, LPS electroporation. Graphs show 
mean + s.d. of triplicate wells and represent three independent experiments. 
d, Immunoblots of Casp1 ~~ BMDMs stimulated with Pam3CSK4 for 5 h. For 
source gels of a, b and d, see Supplementary Fig. 3. 
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Wt gRNA Gg target region 
AGTGATGTTGTCAGGCATGGGAGAAGGGAAAATTTCTGGTGGGGCTGCAGTGTCTGAC 
. ValMetSerLeuGlyMetGlyGluGlyLysI leSerGlyGlyAlaAlaValSerAsp.. 


Gsdmd*'  (1bpTde1/1bpGins) 
allele1: AGTGATGTTGTCAGGCATGGGAGAAGGGAAAATTTC—GGTGGGGCTGCAGTGTCTGAC 
allele2: AGTGATGTTGTCAGGCATGGGAGAAGGGAAAATTTCGTGGTGGGGCTGCAGTGTCTGAC 


Gsdmd*-? (1bpTde1/1632bpdel) 
allele1: AGTGATGTTGTCAGGCATGGGAGAAGGGAAAATTTC—GGTGGGGCTGCAGTGTCTGAC 
allele2: S==ea-S- Sos Se5==5 === SSS 


Gsdmd** (1bpTins/1bpTins) 
allele1: AGTGATGTTGTCAGGCATGGGAGAAGGGAAAATTTCT TGGTGGGGCTGCAGTGTCTGAC 
allele2: AGTGATGTTGTCAGGCATGGGAGAAGGGAAAATTTCT TGGTGGGGCTGCAGTGTCTGAC 


Gsdma** (9@bpdel+3bpins/10bpdel+5bpins) 
allele1: AGTGATGTTGTCAGGCATGGGAGAAGGGAAAATTTCT—------------------—— 
allele2: AGTGATGTTGTCAGGCATGGGAGAAGGG--—-—— TGACCGTGGGGCTGCAGTGTCTGAC 


Wt gRNA G2 target region 
CGATGGGAACATT CAGGGCAGAGTGATGTTGT CAGGCATGGGAGAAGGGAAAATTTC 
:AspGlyAsnI leGLinGlyArgValMetLeuSerGlyMetGlyGluGlyLysIle... 


Gsdmd**®*® —(2bpATins/19bpdel) 
allele1: CGATGGGAACATTCAGGGCAGAGTGATGTTATGTCAGGCATGGGAGAAGGGAAAATTTC 
allele2: CGATGGGAACATTCAGG------------------- CATGGGAGAAGGGAAAATTTC 


Gsdmd*'°  (2bpATins/2bpATins) 
allele1: CGATGGGAACATTCAGGGCAGAGTGATGTTATGTCAGGCATGGGAGAAGGGAAAATTTC 
allele2: CGATGGGAACATTCAGGGCAGAGTGATGTTATGTCAGGCATGGGAGAAGGGAAAATTTC 


Gsdma*"*  (19bpde1/19bpdel) 


allele1: CGATGGGAACATTCAGG-----------------—— CATGGGAGAAGGGAAAATTTC 
allele2: CGATGGGAACATTCAGG-----------—----—-—— CATGGGAGAAGGGAAAATTTC 
gRNA G2/G9 


Gsdmd*"'  (G2:1bpTins/G9:5bpdel) 
allele1: CGATGGGAACATTCAGGGCAGAGTGATGTTTGTCAGGCATGGGAGAAGGGAAAATTTC 
allele2: AGTGATGTTGTCAGGCATGGGAGAAGGGAAAA---—- GGTGGGGCTGCAGTGTCTGAC 


Gsdmd*'?  (G2:203bpde1/G9: 4bpdel) 
allele: =sa—=SneSes=s aaa eee a 
allele2: AGTGATGTTGTCAGGCATGGGAGAAGGGAAAAT----GGTGGGGCTGCAGTGTCTGAC 


Extended Data Figure 8| Gsdmd ‘“ alleles. Gsdmd ‘~ animals used inthis _ highlighted in bold. Deleted bases are indicated by red hyphens. Inserted 
study were compound or homozygous F1 and F2 knockouts generated from _ nucleic acids are highlighted in red. 
mosaic FO founder and F1 crosses, respectively. gRNA target sequences are 
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Extended Data Table 1 | Bioinformatic analysis of ENU-induced SNVs present in IGL1351 pedigree 


Chromosome| Coordinate (Assembly version: GRCm38) | Amino Acid Change [Splice Position Polyphen Score| Polyphen Prediction | SIFT Score | SIFT Prediction MGI accession Id Gene Name Ref Base | Var Base | Amino acid position | Codon 
F | 75411276 Disrupted splicing 410 MGI:409282 Speg T A 
41 80593159, K->N 0.90 Possibly damaging 0.02 deleterious MGI:2146320 Dock10 T [eJ 339 TT 
4 151792385, V-2A 0.79 Possibly damaging 0.00 deleterious MGI:1914217. Edem3 T Cc 305 GTC 
4 190933113. NOK 0.00 Benign 0.28 tolerated MGI:1196310 Angel2 Cc A 402 AAC 
= 119246455 DG 4.00 Probably damaging 0.01 deleterious MGI:1338033 Spintt A G 340 GAT 
3 5401136 Y-2>C 4.00 Probably damaging 0.02 deleterious MGI:2137668 Zfx4 A G 2118 TAT 
3 94886227 Disrupted splicing 6 MGI:1098257, Psmb4. A aT 
3 4127672362 Q>R 0.96 Probably damaging 0.00 deleterious MGI:1918731 Alpk1 T Cc 4165 1G 
5 72338991 C>S 1.00 Probably damaging 0.00 deleterious MGI:4349451 Corin A b 673 ACA 
6 24734179 F2S 4.00 Probably damaging 0.00 deleterious MGI:1921659 Hyalé T Cc 37 Tc 
5d 25295145, R>C 0.98 Probably damaging 0.00 deleterious Pafah1b3 G A 215 ACG 
x 35727415 Disrupted splicing 4 MGI:2443952 Dpy1913 A G 
Zz 46761432 M->1 4.00 Probably damaging 0.14 tolerated MGI:2180307 Hps5 c ki 4113 CAT 
id 48867151 TA 0.12 Benign 0.23 tolerated MGI:1922038 E28 T Cc 827. GGT 
_ 55898243, Y->Stop N/A MGI:1338801 Cyfip1 Cc A 560. TAC 
7 4104960739 Disrupted splicing 9 MGI:3030504 Olfr670 iB A 
8 25020557, Disrupted splicing 3 MGI:1916992 Tm2d2 A G 
8 66512896 IV 0.00 Benign 0.07 tolerated MGI:1921669 Tktl2 A G 369 ATC 
8 122422123 Tl 0.14 Benign 0.04 deleterious MGI:2446486, MGI:5141853 117c,gm20388 c bi 3 ACC 
9 109274572 D>G 0.02 Benign 0.44 tolerated MGI:4354703 Fbxw14 i Cc 347 GTC 
10 117291188 C>R 4.00 Probably damaging 0.00 deleterious MGI:96902 Lyz1 A G 48 ACA 
1 9267565 S->P 1.00 Probably damaging 0.01 deleterious MGI:2388707 Abca13 T Cc 336 TCA 
4 70463590 N>K 0.03 Benign 0.90 tolerated MGI:3603821 Zmynd15. Cc A 426 AAC 
44 78165400 Rol 0.99 Probably damaging 0.03 deleterious MGI:1202880 Trafa Cc A 44 CCG 
4 110148903 Y>C 0.94 Possibly damaging 0.02 deleterious MGI:2386796 Abcad i Cc 428 GTA 
4 119483118 VA 0.04 Benign 0.18 tolerated MGI:1289196 Rnf213 T Cc 5010 GTC 
4 121147246 DE 0.97 Probably damaging 0.44 tolerated MGI:4920929 Tex19.1 7 A 443 GAT 
13 21287499 D>G 7.00 Probably damaging 0.00 deleterious Gpx5 T Cc 478 GTC 
13 38469634 S->P 0.02 Benign 0.21 tolerated Bmp6é T Cc 226 TCC 
14 27464301 E>G 0.06 Benign 0.16 tolerated MGI:1921694 Di4abbie A G 819 GAG 
44 30769820 VA 0.00 Benign 0.22 tolerated MGI:1859609 ‘Sfmbtt T Cc 47 GTA 
14 70145862 TA 0.99 Probably damaging 0.25 tolerated MGI:2444228 261030191 9rik T Cc 229 AGT 
15 13034240. A>E 0.93 Possibly damaging 0.00 deleterious Cdh6 G bi 778 TGC 
45 21237903 VM. 0.99 Probably damaging 0.00 deleterious ‘Cahi2 G A 75 GIG 
15 75864337, 1>N 0.98 Probably damaging 0.00 deleterious MGI:1916396 Gsdmd T A 105 ATT 
7 46726680 D->G 0.01 Benign 0.01 deleterious MGI:1202304 Gnmt T Cc 124 ATC 
18 47281302 K->E 0.23 Benign 0.69 tolerated MGI:1203727 Sema6a ¥ Cc 520 CTT 
18 50281778 W->R 0.00 Benign 0.96 tolerated MGI:2684939 Fam170a A Cc 164 TGG 
19 40607637, lV 0.00 Benign 1.00 tolerated MGI:1914840 Tetn3 T Cc 309 GAT 


Ref Base, reference base. Var Base, variant base. 
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Extended Data Table 2 | Adjusted P values of Fig. 6 


Group -Group | Adjusted p-value 
wt Casp11- 0.0002 
wt Gsdma*- 0.0002 
Casp11- |Gsdmd/- 0.287 
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Selective small-molecule inhibition 
of an RNA structural element 


John A. Howe!l*, Hao Wang'*, Thierry O. Fischmann", Carl J. Balibar', Li Xiao', Andrew M. Galgoci', Juliana C. Malinverni!, 
Todd Mayhood!, Artjohn Villafania’, Ali Nahvi*, Nicholas Murgolo', Christopher M. Barbieri!, Paul A. Mann!, Donna Carr, 
Ellen Xia’, Paul Zuck?, Dan Riley®, Ronald E.Painter!, Scott S. Walker!, Brad Sherborne!, Reynalda de J esus!, Weidong Pan!, 
Michael A. Plotkin', Jin Wu', Diane Rindgen!, John Cummings’, Charles G. Garlisi!, Rumin Zhang’, Payal R. Sheth’, 


Charles J. Gill', Haifeng Tang! & Terry Roemer! 


Riboswitches are non-coding RNA structures located in messenger RNAs that bind endogenous ligands, such as a specific 
metabolite or ion, to regulate gene expression. As such, riboswitches serve as a novel, yet largely unexploited, class of 
emerging drug targets. Demonstrating this potential, however, has proven difficult and is restricted to structurally 
similar antimetabolites and semi-synthetic analogues of their cognate ligand, thus greatly restricting the chemical 
space and selectivity sought for such inhibitors. Here we report the discovery and characterization of ribocil, a highly 
selective chemical modulator of bacterial riboflavin riboswitches, which was identified in a phenotypic screen and acts 
as a structurally distinct synthetic mimic of the natural ligand, flavin mononucleotide, to repress riboswitch-mediated 
ribB gene expression and inhibit bacterial cell growth. Our findings indicate that non-coding RNA structural elements 
may be more broadly targeted by synthetic small molecules than previously expected. 


Bacterial riboswitches are cis regulatory structural elements present in 
the 5’ untranslated region (UTR) of mRNAs that specifically bind to 
natural ligands and regulate gene expression’°. Riboswitches are 
composed of two functionally distinct domains: an aptamer ligand- 
binding domain, and an expression platform. Distinct families of ribo- 
switches bind specific metabolites, including amino acids, vitamins, 
glucosamine-6-phosphate, S-adenosylmethionine, thiamine pyropho- 
sphate, ions, and flavin mononucleotide (FMN)'®. Mechanistically, 
metabolite binding to the cognate riboswitch aptamer induces a con- 
formational change within the expression platform (hence referred to 
as a ‘riboswitch’) leading to altered expression of a gene (or genes) 
involved in the corresponding biosynthetic pathway**. Riboswitch- 
ligand interactions can therefore provide a negative feedback circuit 
to modulate metabolism. 

Small molecules that are structurally dissimilar to the natural ligand 
but which mimic its activity to selectively downregulate riboswitch- 
controlled biosynthetic genes essential for growth (herein referred to 
as a synthetic mimic) could serve as mechanistically novel therapeutic 
agents’ "*. Riboflavin (vitamin B.) biosynthesis serves as one poten- 
tially attractive metabolic pathway to apply this strategy in antibac- 
terial discovery'*. FMN riboswitches regulate gene expression of 
enzymes and transporters of riboflavin biosynthesis and uptake and 
are broadly conserved across bacterial pathogens yet are absent in 
vertebrates’*'®. Whereas bacteria, fungi, and plants either synthesize 
this essential vitamin de novo (Extended Data Fig. 1) or acquire it from 
their environment, humans lack the riboflavin biosynthetic repertoire 
and must consume it as part of their diet. Importantly, riboflavin 
serves as the immediate precursor to FMN, which is subsequently 
converted to flavin adenine dinucleotide (FAD), two essential cofac- 
tors for a diverse set of flavoenzymes central to intermediary meta- 
bolism. Although FMN riboswitches can bind riboflavin and FAD, 
FMN is the cognate ligand with highest affinity to its receptor to 


control riboflavin biosynthesis’'’”*. Therefore, disrupting riboflavin 
biosynthesis either by synthetic mimics of FMN or enzyme inhibitors 
of the pathway represents a novel target for antibacterial intervention 
with limited predicted off-target effects. 

To date, little success has been achieved in identifying synthetic 
mimics of riboswitch ligands despite using a variety of in vitro screening 
strategies, including affinity-based or fragment-based screening, or 
structure-guided design approaches starting with natural ligands’. 
One of the best-characterized antimetabolite inhibitors to a riboswitch 
is roseoflavin’’, a naturally-produced analogue of riboflavin, which 
targets FMN riboswitches from multiple bacterial species'’”*'*"°. Like 
riboflavin, roseoflavin is converted to the flavoenzyme cofactor analo- 
gues, roseoflavin mononucleotide (RoFMN) and roseoflavin adenine 
dinucleotide (RoFAD)’*. In Bacillus subtilis, roseoflavin resistant 
mutants map either to the ribG FMN riboswitch or the FAD synthetase 
gene, ribC’’®. Accordingly, the antibacterial effect of ROFMN results 
from both inhibition of the FMN riboswitch leading to reduced rib gene 
expression”, as well as inhibiting as many as 40 bacterial flavoenzymes”’, 
of which many are conserved in humans*’”’. Therefore, roseoflavin or 
other riboflavin derivatives pose significant potential side-effects as 
antibacterials and lack the necessary target-selectivity as chemical 
probes for chemical biology studies. 

Here we report a phenotypic screen and the discovery of ribocil, a 
highly specific bioactive synthetic mimic of FMN, which competes 
with the natural ligand to inhibit FMN riboswitch-mediated express- 
ion of ribB and inhibits bacterial growth. 


Riboflavin pathway validation and discovery of ribocil 

Riboflavin is essential for microbial growth, however, bacterial genes 
involved in riboflavin biosynthesis are conditionally essential as they 
are indispensable under conditions where external riboflavin is 
absent, but not required under conditions where exogenous riboflavin 
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is present and the vitamin can be acquired by either active or passive 
transport mechanisms*'®**°, Accordingly, we first examined the 
pathogenicity of Escherichia coli strains deleted of ribA or ribB, which 
correspond to early steps in riboflavin biosynthesis but are viable 
when grown in the presence of 20 uM riboflavin supplementation 
(Extended Data Fig. 1). In a murine septicaemia model, E. coli 
AribA and AribB mutants are highly attenuated in their virulence, 
yielding approximately a 3 X log) reduction in bacterial burden ver- 
sus the wild-type isogenic control strain at all infectious doses tested 
(Fig. 1a). Notably, whereas wild-type E. coli causes significant mor- 
tality at the higher infectious doses tested, mice failed to show signs of 
morbidity or mortality with AribA and AribB strains infected at sim- 
ilar or higher infectious inoculates. These results provide genetic 
evidence that E. coli is unable to scavenge sufficient riboflavin in a 
relevant infection setting and predict that selective inhibitors of ribo- 
flavin biosynthesis could display antibacterial efficacy. 

We screened an internal library of ~57,000 synthetic small mole- 
cules with antibacterial activity to identify compounds whose growth 
inhibitory activity against E. coli recapitulates the conditional essen- 
tiality of the riboflavin biosynthetic pathway. To enhance the oppor- 
tunity of identifying such molecules, a suitable E. coli strain (MB5746) 
defective in wild-type lipopolysaccharide (LPS) levels and drug efflux 
was selected’’”. One compound resulting from this phenotypic screen, 
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Figure 1 | Genetic and pharmacological inhibition of riboflavin 
biosynthesis. a, Attenuated virulence of AribA (red) and AribB (blue) mutants 
versus isogenic parent E. coli strain MB5746 (black) across 3 X log escalation 
in infectious dose. Colony forming units per gram of spleen were determined 
at 24h after infection and are reported as an average (n = 5 mice) + s.e.m. 
Numbers denote mouse survival at 24h time point. b, Chemical structure of 
ribocil. c, Whole-cell screening assay to identify riboflavin (RF) inhibitors. 

In the absence of exogenous riboflavin, ribocil demonstrates a clear zone of 
growth inhibition when spotted (twofold dilution series) on an agar plate 
seeded with MB5746. Supplementing the plates with 20 11M riboflavin fully 
suppresses ribocil bioactivity. The figure is representative of three independent 
experiments. d, Dose-dependent depletion of cellular riboflavin levels (AUC, 
area under the curve) by HPLC analysis following ribocil treatment of MB5746 
(ribocil ICs9 = 0.3 UM). The data in the figure is the average of two technical 
repeats (+s.d.) and is representative of three independent experiments. e, Heat 
map summary of mutations (in red) identified in MB5746 by illumina-based 
whole-genome sequencing (>100 X genome coverage) and verified by 
Sanger sequencing for 19 independently isolated ribocil® mutants. All base 
changes are listed in the left column and map to the FMN riboswitch. Black; 
no change versus the parental genome sequence. 
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named ribocil (Fig. 1b) demonstrated complete suppression of its 
bioactivity specifically in the presence of exogenous riboflavin 
(Fig. 1c). As the conditional growth inhibitory activity of ribocil is a 
phenocopy of E. coli Arib mutants, it was further investigated as a 
potential target-specific inhibitor of the riboflavin biosynthetic path- 
way. Corroborating this view, riboflavin levels in E. coli ribocil-treated 
cells revealed a dose-dependent depletion of riboflavin (ICs9 = 0.3 uM; 
Fig. 1d), as well as reduced levels of FAD and FMN (Extended Data 
Fig. 1), mirroring the phenotype of AribA and AribB mutants 
(Extended Data Fig. 1). 


Ribocil target identification and mechanism of action 


To identify the corresponding cellular target of ribocil, we isolated 
multiple (1 =19) independently derived E. coli ribocil-resistant 
(ribocil®) mutants and mapped drug resistant mutations to their tar- 
get through whole-genome sequencing (WGS) (Fig. le). Remarkably, 
all drug-resistant mutants contain base pair changes mapping to the 
FMN riboswitch within ribB (Fig. 2a, b). Notably, none of the iden- 
tified mutations act as compensatory (that is, bypass) mutations con- 
ferring drug resistance by dramatically inducing elevated expression 
of riboflavin either in the presence or absence of ribocil treatment 
(Extended Data Fig. 1). As WGS failed to detect any additional muta- 
tions in all ribocil® isolates, these data firmly implicate the FMN 
riboswitch mutations as causal for ribocil resistance (Fig. le). The 
biological significance of this finding is underscored by the essential 
role of the FMN riboswitch in mediating transcriptional and trans- 
lational control of the riboflavin pathway”, which controls ribB 
expression via a negative feedback loop in concert with its effector 
ligand, FMN (Fig. 2a). Based on the functional role of the FMN 
riboswitch and that all ribocil* mutations map to this regulatory 
element, we hypothesized that ribocil inhibits riboflavin biosynthesis 
directly by mimicking the FMN ligand and binding to the FMN 
riboswitch to inhibit ribB expression. 

Direct demonstration that ribocil specifically targets the riboflavin 
riboswitch was initially achieved using a plasmid-based GFP reporter 
gene under the control of E. coli ribB promoter and FMN riboswitch 
5’ sequence and whose expression is monitored in a ribocil® isolate, 
where a ribocil-mediated effect on GFP expression can be quantified 
without inhibiting cell growth (Extended Data Fig. 2). Ribocil strongly 
inhibits GFP expression, achieving a 50% effective concentration 
(ECs) of 0.3 uM (Extended Data Fig. 2) and paralleling ribocil’s 
50% inhibitory concentration (ICs9) to impair riboflavin synthesis 
(Fig. 1d). Similarly, ribocil effectively inhibited expression of the 
GFP reporter when regulated by orthologous riboswitches from 
Pseudomonas aeruginosa or Acinetobacter baumannii, albeit at 
approximately fivefold higher ECs9 values than against the E. coli 
riboswitch (Extended Data Fig. 2). Conversely, GFP reporter con- 
structs harbouring E. coli ribocil® riboswitch mutations are largely 
non-responsive to ribocil inhibition (Extended Data Fig. 3), corrob- 
orating a direct interaction between ribocil and the riboswitch. 

As ribocil lacks potent whole-cell activity against A. baumannii and 
P. aeruginosa (see below), classical drug resistance selection studies 
could not be performed against such organisms. To overcome this 
issue, we constructed surrogate E. coli strains in which the native 
chromosomal FMN riboswitch element regulating ribB was replaced 
with a functional ABFMN or PaFMN riboswitch. Ribocil™ selections 
in either heterologous strain background identified 11 and 19 ribocil® 
mutations causing unique single nucleotide changes or deletions 
within the ADFMN and PaFMN riboswitch, respectively (Extended 
Data Fig. 2). Finally, the frequency of resistance (FOR) to ribocil 
was quantified against E. coli MB5746 and FMN riboswitch recom- 
binant strains (Extended Data Table 1). Whereas the E. coli parent 
(FOR = 2.4 X 10 °) and PaFMN recombinant (FOR = 6.4 X 10°”) 
displayed significant target-based resistance, the ADFMN riboswitch 
variant yielded an approximately 80-fold lower (FOR = 3.3 X 10° *) 
resistance rate compared to the parental strain. Collectively, 
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Figure 2 | FMN riboswitch and ribocil® mutation mapping. a, Schematic 
representation of the FMN riboswitch containing the 5’ mRNA FMN 
aptamer which binds FMN and the 3’ expression platform which directly 
coordinates expression of the downstream ribB gene open reading frame 
(ORF). Left, FMN aptamer binding to the FMN ligand transduces a 
conformational change in the expression platform, resulting in the formation 
of a terminator/sequester loop that abolishes ribB expression (OFF) in a 

dual manner: (1) early termination of transcription of the ribB ORF; and 

(2) sequestering the Shine-Dalgarno ribosome binding sequence (RBS) to 
prevent translation of fully transcribed ribB mRNAs”. Alternatively, in the 


these genetic data strongly suggest that ribocil directly inhibits FMN 
riboswitch-mediated ribB expression across FMN riboswitches span- 
ning phylogenetically diverse bacteria. 

Direct ligand binding of ribocil to the E. coli FMN riboswitch was 
obtained by biophysical means using the synthesized FMN aptamer 
(Extended Data Fig. 3). To first characterize the structural integrity of 
the purified aptamer, FMN binding was quantified using spectro- 
scopy to monitor quenching of the intrinsic fluorescence of FMN 
upon complex formation with RNA*”*. The binding affinity Ky value 
obtained for FMN binding to the E. coli FMN riboswitch aptamer 
(Kg = 1.2 nM), other kinetic parameters, and divalent cation (Mg’*) 
requirement were found to be similar in range to data reported 
for FMN riboswitches from a wide variety of bacterial species 
(Extended Data Fig. 3)'*°. Addition of ribocil in this FMN fluor- 
escence quenching assay restores FMN fluorescence in a potent and 
dose-dependent manner (Ky = 16 nM), demonstrating that ribocil 
directly competes with FMN for binding to the RNA aptamer 
(Extended Data Fig. 3). 


Structure of ribocil-riboswitch RNA aptamer 

Whereas we were unsuccessful in co-crystalizing the E. coli FMN 
aptamer with ribocil, a 2.95 A structure of ribocil bound to the 
Fusobacterium nucleatum impX FMN riboswitch aptamer was 
achieved (Fig. 3a, b and Extended Data Fig. 4). The primary sequence 
of the F. nucleatum aptamer conforms well to the E. coli sequence as 
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absence or depletion of FMN (right), the FMN aptamer adopts a second 
structural conformation (ON) that induces an anti-terminator/anti-sequester 
loop, thereby facilitating uninterrupted ribB expression. In this way, the FMN 
riboswitch serves as a highly tunable negative feedback system to control 
riboflavin biosynthesis through the regulation of ribB expression. b, Predicted 
secondary structure of the E. coli FMN riboswitch and mapping of ribocil® 
mutations. Initial secondary structure predictions were derived by global 
alignment of E. coli, A. baumannii, P. aeruginosa and F. nucleatum FMN 
riboswitches (Extended Data Fig. 5). Ribocil® mutations are indicated in red. 
Green boxes indicate predicted key contacts with FMN in the E. coli riboswitch. 


well as the riboswitch family consensus sequence (Extended Data 
Fig. 5) and to date is the only FMN riboswitch aptamer for which a 
crystal structure has been reported”’. In the ribocil-F. nucleatum apta- 
mer co-crystal, ribocil adopts a constrained U-shaped conformation 
(Extended Data Fig. 4) and, like FMN", is positioned inside the junc- 
tional region of the six RNA stems (Fig. 3a). Within the left arm of 
ribocil, the 6-thiophenyl-pyrimidonyl group stacks face-to-face 
between A48 and A85 and edge-to-face with A49 (Fig. 3b). A key 
H-bond also forms between the ribocil pyrimidonyl oxygen and A48 
and A99. Overall, the left arm of ribocil binds in a similar conformation 
as the planar isoalloxazine ring system of FMN (Extended Data Fig. 4), 
which adopts a continuous stacking alignment bridging the P6 and P3 
helices of the riboswitch’*. Further, ribocil’s right arm methyl group at 
position 5 in the piperidinyl acts as a weak H-bond donor with G11 
while the methylamino-pyrimidinyl stacks face-to-face with G62 
(Fig. 3b). Notably, both G11 and G62 are conserved guanines that form 
key interactions with the phosphate group of FMN”. 

Although a racemic mixture of (S) and (R) enantiomers was 
used for co-crystallization, electron density mapping of ribocil in 
the co-crystal structure predicts that only the (S)-isomer is bound 
(see Methods and Extended Data Fig. 4). Separation of the ribocil 
mixture led to isolation of the isomers named ribocil-A and ribocil-B. 
Ribocil-B demonstrated superior microbiological activity as compared 
to ribocil-A (minimum inhibitory concentration (MIC) = 1 pg ml! 
versus MIC = 641g ml !; Extended Data Table 1), inhibition of 
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Figure 3 | X-ray crystal structure of ribocil bound to the F. nucleatum FMN 
riboswitch. a, Overall structure of the ribocil-bound F. nucleatum FMN 
riboswitch aptamer. The aptamer is represented as a cartoon and coloured 
by P-loop domain as labelled. The ligand is represented as spheres, with methyls 
coloured in slate blue, oxygen and nitrogen atoms are in red and blue, 
respectively. b, The RNA and ligand structures are represented as sticks. The 
carbon atoms coloured in orange for the RNA and slate blue for the ligand. 
The solvent-accessible surface is show in grey. The surface is in dark grey where 
it faces up. Several bases in the vicinity of the ligand are labelled. A85 is 
positioned on top of the compound in the figure orientation. Polar interactions 
are shown with red dashes, and an interaction where a methyl acts as weak 
H-bond donor is drawn as a black dashed line. 


riboflavin synthesis (ICs) = 0.13 4M versus IC; > 26 1M; Extended 
Data Fig. 3), and binding affinity to the E. coli FMN aptamer 
(Kg=66nM versus Kg=10,000nM, Extended Data Fig. 3). 
Together these results provide compelling evidence that the biologically 
active ribocil-B isomer is the (S)-isomer bound in the co-crystal. 
Analysis of E. coli ribocil® mutations (Fig. 2b, Extended Data Fig. 5 
and Supplementary Video) predicts that FMN aptamer mutant A94- 
102 directly disrupts ribocil binding by the deletion of a key nucleotide 
contact (G96) with the compound. This is based on bacterial FMN 
aptamer homology models which highlight G96 as being equivalent to 
the G62 nucleotide in the F. nucleatum FMN aptamer (Extended Data 
Fig. 5) and that this guanine stacks with the right arm methylamino- 
pyrimidinyl portion of ribocil in the co-crystal structure (Fig. 3b). 
Corroborating this view, the ECFMN-GFP fusion containing the 
A94-102 allele is unaffected by ribocil (ECs) > 200 UM) (Extended 
Data Fig. 3). Alternatively, four other ribocil* FMN aptamer muta- 
tions (C33U, G37U, G93U and C111U) do not directly impair direct 
contacts with ribocil but probably disrupt Watson-Crick base pairing 
or tertiary contacts within the FMN riboswitch structure. As such, 
each of these mutations probably induces a conformational change 
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within the riboswitch sufficient to effectively reduce ribocil binding 
affinity. Indeed, all ECFMN-GFP fusions containing these alleles also 
demonstrate significantly reduced ribocil binding (ECs» = 200 1M) 
yet preserve sufficient functional expression of ribB to maintain cell 
growth (Extended Data Fig. 3). Alternatively, ribocil effectively inhi- 
bits expression (ECs 9 = 0.2 UM) of the ECFMN-GFP fusion contain- 
ing the ribocil® mutant (C100U), demonstrating that ribocil binding 
was unaffected and resistance must be conveyed by an alternative 
mechanism (see below). U218C and C219U map to the expression 
platform rather than the FMN aptamer (Fig. 2b) and probably 
uncouple FMN-mediated repression of ribB expression by disrupting 
the requisite base pairing within the sequestration loop to expose the 
Shine-Delgarno sequence for constitutive ribB expression. Consistent 
with this possibility, ribocil ECs, values against ECFMN-GFP con- 
taining these mutations are similar to the wild-type ECFMN-GFP 
fusion (ECs9 = 0.2-2.0 uM) (Extended Data Fig. 3). Although 
C100U, U218C and C219U bind ribocil with ECs, values similar to 
the wild-type riboswitch, these mutations probably confer ribocil res- 
istance because riboflavin levels are not suppressed sufficiently to 
inhibit cell growth (Extended Data Fig. 1). Notably, all ribocil® muta- 
tions are markedly cross-resistant to roseoflavin (Extended Data 
Fig. 6), consistent with ribocil and roseoflavin competitively binding 
to the FMN aptamer. 

Ribocil specifically displays microbiological activity against 
E. coli MB5746 maintaining either the native FMN riboswitch 
(MIC =2,1gml~') or orthologous FMN aptamers (MIC range 
1-16 yg ml) (Extended Data Table 1) and lacks activity against 
yeasts and human cells which lack the cognate target (Extended 
Data Table 1; see Methods). Although ribocil is effectively effluxed 
in E. coli (Extended Data Table 1), inactivity against yeasts is not 
mediated by efflux (Extended Data Table 1). Ribocil is also more 
potent than roseoflavin (MIC>128pgml~') against MB5746 
(Extended Data Table 1) and, unlike roseoflavin, ribocil bioactivity 
is fully suppressed by exogenous riboflavin (Extended Data Fig. 6). 

To evaluate the effects of inhibiting FMN riboswitch function in a 
murine E. coli septicaemia model of infection, an (S) enantiomer 
analogue of ribocil named ribocil-C that is mechanistically equivalent 
to ribocil (Extended Data Fig. 3) and displays eightfold improved 
antibacterial activity (Extended Data Table 1) was tested. MB5746 
bacterial burden in sham-treated mice and low-dose ribocil-C-treated 
groups (30 mg kg | ribocil-C) ranged from 9-9.5 log;[colony-form- 
ing units (CFU) per g spleen] (Fig. 4a,b). Conversely, higher dose 
ribocil-C treatment groups (60 and 120 mg kg! ribocil-C) demon- 
strated a dose-dependent reduction in bacterial burden of 1.87 and 
3.29 logio[CFU per g spleen] reduction respectively versus sham- 
treated mice, without mortality or gross effects of toxicity observed 
(Fig. 4). These data provide a pharmacological demonstration that 
riboswitch-mediated inhibition of riboflavin biosynthesis is effi- 
cacious in a murine systemic infection model. 


Discussion 


As a synthetic mimic, ribocil demonstrates high affinity and shares 
critical contacts to the FMN binding site sufficient to effectively com- 
pete with the natural ligand for binding to the riboswitch aptamer and 
disrupt gene expression. Ribocil elicits its antimicrobial effects in a 
highly target-selective manner (Supplementary Discussion) and 
demonstrates that gene regulatory riboswitches are ‘druggable’ by syn- 
thetic chemistry. Ribocil also illustrates how such a synthetic mimic 
demonstrates superior selectivity in its inhibitory effects on whole cells 
compared to roseoflavin or other antimetabolite ligands. For example, 
roseoflavin not only targets the FMN aptamer but FAD synthetase”’, as 
well as multiple functionally diverse flavoproteins that require FMN as 
a cofactor and that are conserved in humans*!”®. Conversely, ribocil 
specifically targets bacterial FMN riboswitches and (unlike roseofla- 
vin) its growth-inhibitory activity is completely suppressed by the 
addition of exogenous riboflavin (Extended Data Fig. 6); a condition 
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Figure 4 | In vivo activity of ribocil in a murine systemic infection model of 
E. coli. a, DBA/2J mice were infected by intraperitoneal injectionwith E. coli 
strain MB5746 (5.0 X 104 CFU per mouse) and treated by subcutaneous 
injection with ribocil-C or ciprofloxacin at the indicated doses (mg kg *) three 
times over a 24 h infection period. Spleens were aseptically collected from 
five mice per group and the reduction of log[CFU per g spleen tissue] was 
calculated on the basis of bacterial burden in spleens of the vehicle-treated (10% 
DMSO) control group. Data represents the average CFU per g spleen + s.e.m. 
One-way ANOVA with Dunnett’s multiple comparison test demonstrates 
statistically significant (***P < 0.001) log{CFU per g] reductions in the 60 and 
120mgkg ' ribocil-C treatment groups and ciprofloxacin control. b, The CFU 
data from each mouse plotted as individual points, solid bar and error bars 
represent the average CFU per g spleen + s.e.m. One-way ANOVA with 
Dunnett’s multiple comparison test demonstrates statistically significant 
(***P < 0.001) log{CFU per g] reductions in the 60 and 120 mg kg * ribocil-C 
treatment groups and ciprofloxacin control. Similar efficacy was observed 
independently at a tenfold higher infectious inoculum (Extended Data Fig. 8). 


only expected if ribocil does not additionally inhibit flavoproteins or 
other essential proteins. Notably, roseoflavin (and presumably other 
riboflavin antimetabolites) normally requires a highly specific cognate 
transporter to enter cells that is absent among Gram-negative bac- 
teria‘**°, This complicates roseoflavin or similar antimetabolites 
achieving antibacterial activity against the most serious class of 
drug-resistant bacterial pathogens*’. As a synthetic mimic, however, 
ribocil is not dependent on a riboflavin transport system to enter 
bacterial cells and inhibit growth. 

Phenotypic screens have historically proven highly successful, 
prompting the renewed interest in them, particularly in the areas of 
infectious disease and oncology*”*’. Notwithstanding potential lim- 
itations to the phenotypic screen we describe here (Supplementary 
Discussion), several advantages to such a screening approach are 
noteworthy. Importantly, all targets constituting the pathway of inter- 
est are screened in an agnostic fashion for cognate inhibitors. Here, 
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rather than identifying enzyme inhibitors to any of the steps in the 
riboflavin pathway, ribocil with its unique mechanism of action 
was identified, exemplifying the value of unbiased phenotypic 
screens. Importantly, a robust demonstration that target-specific 
inhibitors are identified in the screen is simply achieved by reversing 
their bioactivity in a pathway-specific manner; namely, supplement- 
ing drug-treated cells with riboflavin. Indeed, suppression of small- 
molecule bioactivity, whether achieved genetically**** or through 
metabolite supplementation”, provides a straightforward means to 
unambiguously identify bioactive pathway-specific inhibitors. Lastly, 
performing a chemical similarity search for roseoflavin-like molecules 
within the screening library revealed several riboflavin analogues 
not identified in the phenotypic screen. Although all these com- 
pounds demonstrate potent antibacterial activity and inhibit ribo- 
flavin, FMN and FAD synthesis, none of these compounds are 
effectively suppressed by riboflavin (Extended Data Fig. 7). Like 
roseoflavin, we conclude that the growth-inhibiting effects of these 
riboflavin analogues are also unselective, highlighting the effective- 
ness of the phenotypic screen to uniquely identify bona fide synthetic 
mimics of FMN. 

Finally, our work suggests ‘druggable’ targets be considered more 
broadly from how they are traditionally viewed, which is generally 
defined as single enzymes, receptors or structural proteins susceptible 
to chemical inhibition. Indeed, the diverse modes of action that anti- 
bacterial agents exhibit emphasizes that therapeutic small molecules 
interdict substantially more complex targets that are often only rel- 
evant in a cellular context”. Although our study emphasizes that 
target-based drug resistance may compromise the suitability of 
FMN riboswitches as antibacterial targets, we caution that it is pre- 
mature to generalize this conclusion across other classes of ribos- 
witches. Indeed, drug resistance may be mitigated by targeting 
genus-specific riboswitches (Supplementary Discussion) with the 
added benefit that such antimicrobials may minimally impact gut 
microbiota**. Finally, human genetic disorders including fragile X 
syndrome, myotonic dystrophy, and Huntington disease, all result 
from expanded nucleotide repeats located in transcripts which fold 
into stable RNA hairpin structures and result in impaired splicing 
and/or translation*’. As such, these disease-causing RNA elements 
represent a completely novel class of drug targets to which significant 
drug resistance would not be predicted from somatic cells. It is also 
estimated that the human transcriptome comprises thousands of non- 
coding RNAs which also adopt highly specific secondary structures of 
potential functional and disease-causing significance**’. Thus, RNA 
structural elements may substantially expand our view of the target 
space susceptible to therapeutic intervention. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. The experiments 
were not randomized. 

Microbiological studies. MB5746 is an antibiotic-sensitized E. coli strain har- 
bouring an envA1 mutation and tolC deletion’. Consequently, MB5746 is outer 
membrane hyper-permeable and efflux deficient. The primary screen was per- 
formed in 1,536-well plate format. MB5746 (1:10,000 dilution of an overnight 
culture 2-4 X 10? CFU ml‘) was grown in cation-adjusted Mueller Hinton 
broth (CAMHB; BD BBL, cat. no. 212322), with or without 10 uM riboflavin 
(riboflavin; Sigma, cat. no. 4500-100G) supplementation. The final medium 
volume in each well was 5 pl, to which 50 nl DMSO containing twofold-titrated 
compound was transferred. Hit compounds whose activity was suppressed by 
riboflavin were retested with CAMH agar containing MB5746 (1:1,000 dilution of 
overnight culture). MICs were determined by the broth microdilution method as 
recommended by the Clinical and Laboratory Standards Institute with one excep- 
tion: bacterial strains were tested in M9 broth. 

Isolation of E. coli ribocil resistant mutants. MB5746 was grown to late-expo- 
nential phase in CAMHB and spread on CAMH agar plates (BD BBL cat. no. 
211438) containing twofold escalating agar MIC levels of ribocil. To establish 
the number of viable cells in the starting inoculum, the culture was serially 
diluted and plated on CAMH agar plates lacking ribocil. Resistant isolates were 
re-streaked on plates containing the same ribocil concentration. The frequency of 
resistance (FOR) was determined, dividing the number of resistant isolates by the 
viable CFU in the late-exponential inoculum. 

Deletion of E. coli ribA and ribB. ribA and ribB were knocked out in strain 
MB5746 by A-Red recombineering using the linear PCR product generated by 
amplification from pKD4 (ref. 2) with the primer pairs P1/P2 or P3/P4, respect- 
ively. Transformation and selection on kanamycin were performed as described 
previously“, except that 500 jg ml riboflavin was added to the selection plate to 
maintain the auxotrophic mutants. Upon obtaining strains MB5746AribA::kan 
and MB5746 AribB::kan, riboflavin growth assays were performed to deter- 
mine the minimal concentration of riboflavin required to maintain growth 
of the auxotrophic mutant. Liquid cultures of CAMHB inoculated with 
1 X 10° CFU ml * of the mutants grown overnight on solid media were supple- 
mented with a twofold dilution series of riboflavin ranging from 250 pg ml to 
0.25 jig ml‘. After 24h of incubation at 37 °C growth was scored visually and it 
was determined that as little as 0.5 1gml~’ was sufficient to maintain some 
growth of the auxotrophic mutants, but optimal growth was observed with a 
minimum of 4 1g ml riboflavin. 

RNA preparation of the E. coli wild-type FMN aptamers. DNA templates for 
in vitro transcription of aptamer RNA were prepared by PCR from the pCDF- 
EcFMN-GFP reporter plasmid using a forward primer ApT7 (TAATACGACTC 
ATTATAGGgcttattctcagggcg) incorporating the T7 promoter and a reverse pri- 
mer ApRev (cgttactctctcccatccg). Uppercase represents additional sequences 
added in the primer including the T7 promoter, lowercase represents the ribos- 
witch sequence. In vitro RNA aptamer transcription was carried out using the 
RiboMAX large scale RNA production system kit (P1300, Promega) using the 
protocols provided by the manufacturer. After extraction with phenol/cholor- 
form the RNA aptamers were further purified by column chromatography on 
NAP-10 sephadex columns (GE Healthcare) and isopropanol precipitation. 

In vivo evaluation of E. coli AribA and AribB virulence. E. coli MB5746 
and deletion mutants (AribA, AribB) were reconstituted in 10 ml trypticase 
soy broth (TSB, Corning, cat. no. 46-060-CI). The mutant strains were supple- 
mented with 2.5 1g ml * riboflavin (Sigma, cat. no. R4500-100G) and incubated 
at 35 °C for 6 h with shaking at 250 r.p.m. The respective 6-h cultures were used to 
seed, at a ratio of 1:50 ml, TSB either with or without 2.5 pg ml! riboflavin in a 
250 ml flask and were incubated at 35°C for 16 h with shaking at 250r.p.m. 
Overnight cultures were centrifuged at 5,000r.p.m. for 12min at 5°C. 
Supernatant was decanted and pellets were re-suspended in 50 ml fresh TSB to 
remove excess riboflavin. Six tenfold serial dilutions were made in TSB from these 
stock cultures (1.2 X 10'° CFU ml“! for wild type, 5.7 X 10° CFU ml™! for AribA, 
3.6 X 10° CFU ml! for AribB). Select serial dilutions were further diluted 1:10 
into 3% gastric hog mucin for intra-peritoneal (i.p.) injection into mice. The 
initial dilutions without mucin were plated for quantification on TSA II 
(5% sheep’s blood) agar plates (BD BBL, cat. no. 221261) for the wild-type strain 
or 10pgml~’ riboflavin-infused Muller Hinton II Agar (BD BBL, cat. no. 
211438) plates for mutant strains. 

Eleven-week-old female DBA2/J mice (Jackson Labs) were chosen for this 
study based on weight (~20 g) and combined into one pool. Animals were then 
randomly selected from this pool and placed in groups of five in separate boxes. 
Subjects were treated with 150mg kg ' ip. cyclophosphamide (Baxter, NDC# 
10019-955-50) on day —4 and 100 mg kg * on day —1. On day 0, five mice per 


group were injected ip. with 0.5 ml of a respective dilution of bacteria in 3% 
mucin (6.0 10°, 10°, 10 CFUml™' for wild type, 2.8510’, 10°, 
10°CFU ml? for AribA, 1.8 X 10’, 10°, 10°CFU ml" for AribB). On day 1, 
subjects were euthanized via CO, asphyxiation and spleens were aseptically 
removed, weighed and homogenized in 1.5 ml of sterile saline (Hyclone, cat. 
no. SH30028.03) with 10% glycerol (Fisher Scientific, cat. no. BP229-1). Tissue 
homogenates were serially diluted tenfold in sterile saline and selected concen- 
trations were plated on either TSA II (5% sheep’s blood) agar plates for the wild 
type or MH riboflavin infused agar plates for AribA and AribB mutants. Plates 
were incubated at 35 °C for 24h and CFU per g of spleen tissue were determined. 
No data was excluded from this study and investigator blinding was not imple- 
mented during this study. This study was approved and was in compliance with 
the ethical regulations set forth by the Institutional Animal Care and Use 
Committee ([ACUC) at Merck Research Laboratories, Kenilworth, New Jersey. 
Antibacterial effect of ribocil-C in a mouse septicaemia model. E. coli MB5746 
was reconstituted in 10 ml trypticase soy broth (TSB, Corning, cat. no. 46-060-CI) 
and incubated at 35 °C for 6 h with shaking at 250 r.p.m. The 6 h culture was used 
to seed, at a ratio of 1:50 ml, TSB in a 250 ml flask and were incubated at 35 °C for 
16h with shaking at 250r.p.m. The overnight culture was centrifuged at 
5,000 r.p.m. for 12 min at 5°C. Supernatant was decanted and the pellet was 
re-suspended in 50 ml fresh TSB to remove excess riboflavin. Nine tenfold serial 
dilutions were made in TSB from the culture (1.0 X 10'° CFU ml '). The third 
dilution (1.0 X 10’ CFU ml“ ') was further diluted into 3% gastric hog mucin for 
ip. injection into mice. The initial dilutions without mucin were plated for quan- 
tification on TSA II (5% sheep’s blood) agar plates (BD BBL, cat. no. 221261). 
Twelve-week-old female DBA2/J mice (Charles River Laboratory) were chosen 
for this study based on weight (~20 g) and combined into one pool. Animals were 
then randomly selected from this pool and placed in groups of five in separate 
boxes. Subjects were treated by intraperitoneal (i.p.) injection with 150 mg kg’ of 
cyclophosphamide (Baxter, NDC# 10019-955-50) on day —4 and 100mg kg 
on day —1. On day 0, mice were inoculated ip. with 0.5 ml of bacteria in 3% 
mucin (5.0 X 10* CFU ml‘; Fig. 4) or a higher inoculum of 5.0 X 10° CFU ml * 
(Extended Data Figure 8). Thirty minutes post-inoculation, mice (n =5 per 
group) were treated by subcutaneous (s.c.) injection three times over 24h with 
either ciprofloxacin (0.5 mg kg’, Sigma Aldrich, cat. no. 17850-5G-F, ribocil-C 
(at either 120, 60, or 30 mg kg ~ ')) or 10% DMSO (Sigma Aldrich, cat. no. 276855- 
1L) sham. On day 1, subjects were euthanized via CO, asphyxiation and spleens 
were aseptically removed, weighed and homogenized in 1.5 ml of sterile saline 
(Hyclone, cat. no. SH30028.03) with 10% glycerol (Fisher Scientific, cat. no. 
BP229-1). Tissue homogenates were serially diluted tenfold in sterile saline and 
selected concentrations were plated on TSA II (5% sheep’s blood) agar plates. 
Plates were incubated at 35°C for 24h and CFU per g of spleen tissue were 
determined. A normality test was performed to verify normal distribution of data 
before determining statistical significance via the one-way Bonferroni ANOVA. 
No data was excluded from these studies and investigator blinding was not 
implemented during this study. This study was approved and was in compliance 
with the ethical regulations set forth by the Institutional Animal Care and Use 
Committee ([ACUC) at Merck Research Laboratories, Kenilworth, New Jersey. 
HPLC-based quantitative analysis of flavins. Overnight cultures of MB5746 or 
MB5746 Ribocil® cells were diluted 1:50 in CAMHB and distributed (1.25 ml) 
into 10-ml culture tubes containing diluted ribocil (twofold dilution series) or 
DMSO (1%) as mock control. The treated cultures were incubated with shaking at 
37 °C for about 20h, after which the OD goo of the culture was determined and 500 
ul was moved to a 96-well deep-well plate. After centrifugation (4,000 r.p.m.) for 
10 min, the bacterial cell pellets were rinsed with lysozyme dilution buffer (10 mM 
Tris HCl (pH 8.0), 25mM NaCl, 1mM EDTA) and centrifuged again. Cell 
pellets were then re-suspended in 100 pl of lysozyme solution (10 mg ml”! 
lysozyme (Sigma) in lysozyme dilution buffer), incubated at 37°C for 30 min, 
and then frozen at —20°C. Riboflavin, FMN and FAD concentrations in the 
bacterial lysates were determined using the Vitamin B2 HPLC detection kit 
(ImmuChrom, GmbH) and Vitamin B2 column (IC2300rp, ImmuChrom 
GmbH) following the procedure recommended by the manufacturer scaled 
for a 50 pl sample (bacterial lysate). A Shimadzu HPLC system with fluorescence 
detector was used at a flow rate of 1.0ml min™! and flavin detection was 
carried out at 450 nm. Flavin levels were determined for an equivalent number 
of cells by correcting raw AUCs using the ODgop ratio of the treated versus the 
untreated cultures. 
Oligonucleotides used for reporter and strain construction. P1:tatggcaaaat 
aagccaatacagaaccagcattatctggagaatttcatgstgcaggctgeagctgcttc; P2:aagcaaatgaatta 
cacaatgcaagaggsttatttgttcagcaaatggcccatatgaatatcctccttag; P3:gactgccctgattctggta 
accataattttagtgagetttttttaccatggtgcaggctegagctgcttc; P4:gattaaggcagtaaattaagcage 
gettttcagctggctttacgctcatgcatatgaatatcctccttag; P5:CTCAAATGCCTGAGGTTT 
CAGcaggacttgcgtttggacgtc; Po;GAAAAGTTCTTCTCCTTTACTCATggtaaaaaa 
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acctcactaaaattatg; P7:;GACGTCCAAACGCAAGTCCTGctgaaacctcaggcatttgag; 
P8:CATGGATGAGCTCTACAAATAAgcgcaacgcaattaatgtaag; P9;CATAATTT 
TAGTGAGGTTTTTTTACCatgagtaaaggagaagaacttttc; P10:CTTACATTAATT 
GCGTTGCGCttatttgtagagctcatccatg; P11:CATTAGCGTTATAGTGAATCCGC 
taacgttctcaggscgeeste; P12:;GAAAAGTTCTTCTCCTTTACTCATgcgacctcccgtttt 
tecgcec; P13: CATTAGCGTTATAGTGAATCCGCtaaaacccatcgcttcagggc; P14:GA 
AAAGTTCTTCTCCTTTACTCATaatgaaacgctctcgtaagaatac; P15:CACCCCGCC 
CTGAGAACGTT Agcggattcactataacgctaatg; P16:GGCGGAAAAACGGGAGGT 
CGCatgagtaaaggagaagaacttttc; P17;GCCCTGAAGCGATGGGTTTTAgcggattcac 
tataacgctaatg; P18:GPTATTCTTACGAGAGCGTTTCATTatgagtaaaggagaagaactt 
ttc; ADRFN-ribB_RED forward: ttgcatcagtcctgaaatgttcaaccgtattcttacgagagcgtttcatta 
tgaatcagacgctactttc; PaRFN-ribB_RED forward: gtcgcgccggccatgctgcgcegcctgtgcg 
gcggaaaaacggeagetcgcatgaatcagacgctactttc, Ab/Pa RFN-ribB_RED reverse: agatccc 
ggtgcctaatgagtgagctaacttacattaattgcgttgcgcgctggctttacgctcatgtg; yqiC_RED for- 
ward: caggacttgcgtttggacgtcgaactcttcacggcttacaagetcgagececetcagctgcgcttgtage; 
ribB_RED reverse: gattaaggcagtaaattaagcagcgsttttcagctggctttacgctcatgtgcctgacg 
gtatgccacca; yqiC seq. reverse: agttcgctgattctttgttc; ribB seq. reverse 1: agcggaatt 
aacatcttgc; ribB seq. reverse 2: gcttcaatggtcacggtaa. Transition from capital to 
lowercase letters for P5-P18 denotes boundaries of fragments that facilitate 
in-fusion cloning. 

FMN-GFP reporter construction. ECFMN-GFP reporter plasmids were con- 
structed by fusing the EcFMN region, inclusive of 550 bp upstream of ribB 
through the start codon, to gfpuv and cloning into a vector with the low copy 
CloDF13 origin of replication. Primers P5 and P6 were used to amplify the 
EcFMN region from wild-type and resistant mutants by colony PCR, primers 
P7 and P8 were used to amplify the CloDF13 origin and Sm* cassette from pCDF- 
1b (EMD Millipore), and primers P9 and P10 were used to amplify gfpuv 
from pGFPuv (Clontech). Upon purification, all three linear PCR products were 
combined using the in-fusion HD cloning system (Clontech) and transformed 
into TOP10 cells (Life Technologies) with selection on spectinomycin (MP bio- 
medicals) to yield pCDF-EcFMN-GFP reporter plasmids. Plasmids were subse- 
quently transformed into the MB5746 ribocil* mutant M5 background for 
compound testing. 

Initial attempts to create Pseudomonas aeruginosa and Acinetobacter bauman- 

nii FMN-GFP reporters in a similar fashion to the ECFMN-GFP reporters by 
using the PaFMN or AbFMN region, inclusive of 550 bp upstream of ParibE or 
AbribB through the start codon, did not yield constructs with sufficient baseline 
fluorescence (data not shown). In order to optimize fluorescence, hybrid con- 
structs were made in which the E. coli promoter region was placed upstream of the 
PaFMN or AbFMN elements. Primer combinations P11/P12 or P13/P14 were 
used to amplify the PaFMN or AbFMN elements, respectively, from wild-type 
cells by colony PCR, and primer pairs P15/P16 and P17/P18 were used to amplify 
the E. coli promoter, gfpuv, and vector backbone from the previously constructed 
pCDF-EcFMN-GFP plasmid for combination with PaFMN and AbFMN, 
respectively. Purified linear PCR products were combined and transformed as 
described above to yield pCDF-EcPro-PaFMN-GFP and pCDF-EcPro-AbFMN- 
GFP reporter plasmids. Again, plasmids were transformed into the MB5746 
ribocil® mutant M5 background for compound testing. 
Construction of the hybrid ADFMN and PaFMN EcribB strains. The native, 
chromosomal E. coli ribB riboswitch was replaced with that of either A. bauman- 
nii or P. aeruginosa using a two-step A-RED recombineering process“. In the first 
recombineering event, the GFPuv coding sequence from either the pCDF-EcPro- 
AbFMN-GFP or pCDF-EcPro-PaFMN-GFP plasmid was replaced with the 
E. coli ribB coding sequence (EcribB). To this end, MB5746 ribB::kan cells were 
grown in CAMH broth supplemented with 4 yg ml’ riboflavin (reconstituted in 
1:1 dH2O:ethanol) and transformed (as described below) with the temperature- 
sensitive plasmid pKD46“. Reactions were plated onto CAMH agar +50 pg ml! 
ampicillin at 30°C. Next, either the pCDF-EcPro-AbFMN-GFP or pCDF- 
EcPro-PaFMN-GFP plasmid was transformed into MB5746 ribB::kan/pKD46 
and plated onto CAMH + 41g ml | riboflavin + 50 pgml~' spectinomycin + 
50 jg ml‘ ampicillin at 30 °C to maintain double plasmid selection. The result- 
ing strains were recombineered with EcribB PCR product containing flanking 
regions homologous to the cognate pCDF plasmid. Substrate PCR products were 
obtained through colony PCR of the wild-type ribB locus of MB5746 and 
either the AbDFMN-ribB_RED forward or PaFMN-ribB_RED forward primer 
in combination with the Ab/Pa FMN-ribB_RED reverse primer. Cells were recov- 
ered in CAMH broth for 1h and Rib* colonies were selected on CAMH agar 
+50,1gml~! spectinomycin and incubated at 37°C to remove pKD46. The 
resulting strains, MB5746 ribB::kan (pCDF-EcPro-AbFMN-EcribB) or (pCDF- 
EcPro-PaFMN-EcribB) carry a plasmid-borne EcribB gene downstream of the 
native E. coli ribB promoter fused to either ADFMN or PaFMN. 

In the second recombineering event, the plasmid-borne ADFMN- or PaFMN- 
EcribB fusions engineered above were introduced into the E. coli chromosome in 
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single-copy at the native ribB locus. MB5746 ribB::kan/pKD46 cells were grown 
to exponential phase in CAMH broth + 4 pg ml riboflavin + 50 pg ml’ ampi- 
cillin and electroporated with either AbDFMN-EcribB or PaFMN-EcribB PCR 
product containing flanking regions homologous to the native ribB locus. 
These PCR products were amplified from pCDF-EcPro-AbFMN-EcribB or 
pCDF-EcPro-PaFMN-EcribB using the yqiC_RED forward and ribB_RED 
reverse primers. Unlike strains carrying the plasmid-borne hybrid EcribB con- 
structs, the chromosomal hybrid fusion constructs do not yield enough riboflavin 
for optimal growth on CAMH in single copy. Therefore, reactions were recovered 
in CAMH broth containing very low levels of riboflavin (0.4 pg ml *, a concen- 
tration that does not permit growth of the ribB deletion mutant), plated 
onto CAMH agar + 0.4pg ml! riboflavin, and incubated at 37°C to remove 
the pKD46 plasmid. The E. coli ribB promoter, hybrid riboswitch, and EcribB 
coding regions were sequenced in resulting Rib* cells and additionally sequenced 
at joint sequences using yqiC seq. reverse, ribB seq. reverse 1, and ribB seq. reverse 
2 primers. 

All transformations were electroporation reactions performed as suggested” 
with some modifications. Around 30-50 ml of cells were grown in CAMH to 
exponential phase. For recombineering reactions, strains harbouring pKD46 
were induced for 1h with 1% arabinose before harvesting of cells. Cells were 
washed with 30 ml of ice-cold ddH2O and pelleted at 4°C, 3,000g for 10 min, 
followed by two additional washes with 1 ml ice-cold ddH,O and pelleted each 
time at 8,000g at 4 °C for 2 min. Pellets were re-suspended in 300 pl ddH,O and 
100 jl cells were incubated with 1-2 ul PCR product for 5 min before electro- 
poration. Electroporation reactions were performed using 0.1-cm gap cuvettes 
and a GenePulser II (BioRad) with settings at 200 Q, 25 uF, and 1.8 kV. Cells were 
recovered at the appropriate temperature in 1 ml CAMH brothas described above 
and plated on CAMH agar containing the appropriate supplements. All PCR 
reactions were performed using GoTaq Green Master Mix (Promega 
Corporation) according to manufacturer’s instructions. DNA sequencing was 
performed by Genewiz, Inc. 

Crystallization and RNA-ribocil X-ray co-structure determination. Crystals of 
the F. nucleatum FMN riboswitch in presence of the ligand were obtained fol- 
lowing published protocols'*° with minor modifications. The RNA was synthe- 
sized in two strands: GGAUCUUCGGGGCAGGGUGAAAUUCCCGACCGG 
UGGUAUAGUCCACGAAAGCUU and GCUUUGAUUUGGUGAAAUUCC 
AAAACCGACAGUAGAGUCUGGAUGAGAGAAGAUUC. The oligonucleo- 
tides were purchased from Sigma-Aldrich. After reception each strand was dis- 
solved in water, aliquoted so that each aliquot would contain the material 
necessary to make a 25 pl solution at 0.4mM concentration. The aliquots were 
lyophylized using a Centrivap concentrator (Labconco) and kept at —20 °C for 
long-term storage. Prior to annealing the nucleic acids were re-suspended in 25 pl 
annealing buffer (10 mM cacodylic acid, 100 mM acetate, 4 mM MgCl, adjusted 
to pH 6.8 using KOH). The oligomers were mixed together, along with 1.0 pl ofan 
inhibitor stock solution at 50 mM in 100% deuterated DMSO, and annealed in a 
thermocycler by incubation at 37 °C for 30 min followed by cooling from 37 °C to 
4 °C ata rate of 3 °C per min. The crystals were grown by vapour diffusion using a 
15-well EasyXtal DropGuard X-Seal tool (Qiagen) after mixing 3 pl of ribos- 
witch-ligand solution with 3 ul precipitant (0.1M Na acetate, pH 5.0, 0.2M 
MgCh, and 7 to 11% v/v PEG 4K). Small nuclei appear after a few days, and 
are made to grow larger for diffraction studies by controlled drying. Drying is 
achieved by substitution once a day of the adequate volume of well solution with a 
50% v/v PEG 4K stock solution. The volume is calculated to achieve a ~2% 
increase in precipitant concentration per step. The crystals after growth are 
harvested and dipped for 1 to 2 min in a cryoprotectant solution (0.1M Na 
MES, pH 6.5, 0.2 M MgCl, 10% v/v PEG 4K, 20% v/v glycerol, and ligand diluted 
to 1mM concentration). Crystals were harvested with a mesh Litholoop 
(Molecular Dimensions Ltd) and flash-frozen in liquid nitrogen. 

X-ray crystal structure determination. X-ray diffraction data (Extended Data 
Table 2) were collected at the Advanced Photon Source (APS) sector 17 (IMCA) 
at 1.0 A wavelength using a Pilatus 6M (Dectris) pixel array detector. 720 frames 
with an oscillation of 0.25° each were collected. The data were processed using the 
automated pipeline autoPROC”, with calls to the programs XDS” for integration 
and AIMLESS* for scaling. The structure was determined using PDB entry 3F4E 
as a starting point after removing all heterogeneous atoms including the FMN. 
The structure was refined without inclusion of the ligand coordinates at any step 
before and until an omit map difference map is generated to fit the compound. 
The steps include refinement using the program autoBUSTER”, corrections of 
the model and inclusions of several cations with Coot, and Cartesian simulated 
annealing using the program Phenix” to further eliminate the potential of 
bias against FMN which was present in the PDB 3F4E entry. The set of ‘free’ 
reflections was taken from the same PDB entry 3F4E and completed as required. 
All refinement calculations after adding the ligand were performed using the 
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program autoBUSTER”. Model visualization and rebuilding was performed 
using the program Coot”. All figures in the manuscript generated with PyMol”. 
Compound chirality. Co-crystallization of the heptamer with ribocil was per- 
formed using a racemic mixture of the ligand. In spite of its limited resolution, 
2.9 A, the (R) isomer fits distinctly better than the (S) isomer in the initial electron 
density. Consistent with this observation, crystallographic refinement of a model 
which starts with the wrong (R) isomer ends up with a structure with the chiral 
centre inversed and nearly planar, an impossible stereochemistry. By contrast, the 
chiral volume remains unchanged in the course of refinement when starting from 
the (S) isomer. Notwithstanding the electron density map, due to the constrained 
nature of the binding site it is not possible to fit the (R) isomer while maintaining 
reasonable ligand stereochemistry and parallelism of the pyrimidynonyl and the 
methylaminopyrimidynyl between the bases of A48 and A85, and against the base 
of G62. Altogether, crystallographic and stereochemical considerations strongly 
support the conclusion that only the (S) form binds to the FMN aptamer. Further 
observations made later when the isomers were separated agree with this inter- 
pretation: only one of them is active against the riboswitch, and the ligand with 
the correct chirality is more active than the racemic mixture (Extended Data Fig. 3 
and Extended Data Table 1). 

Homology modelling. A homology model of the E. coli FMN aptamer was 
constructed using program mutate_bases* of the 3DNA package using the 
F. nucleatum impX riboswitch aptamer X-ray structure as the template and the 
FMN aptamer alignment of E. coli, F. nucleatum, P. aeruginosa and A. baumannii 
(Extended Data Fig. 5). All nucleotide insertions in the E. coli sequence were 
removed in the model (Extended Data Fig. 5). There are 34 base changes among 
the 111 nucleotides modelled. Base pairing when present remains consistent. 
Energy minimization at A92 was performed to avoid VDW clashes using 
Macromodel (Schrodinger, LLC). 

FMN-GFP reporter assays. Reporter strains were diluted to ~5 X 10° CFU ml! 
in CAMHB supplemented with 30p1gml~' spectinomycin. Compounds to 
be tested were serially diluted twofold through 11 points in 100% DMSO. 
A BioMek FX liquid handler was used to deliver 49 1 of diluted culture into a 
384-well, black/clear-bottom assay plate followed by 1 il of titrated compound. 
DMSO and antibiotic controls were added manually to appropriate wells and the 
plates were shaken for 1 min before incubating at 37 °C. After overnight growth, 
fluorescence, using 405 nm excitation and 510 nm emission, and absorbance at 
600 nm, was assessed on an EnVision multiplate reader (Perkin Elmer). 
Fluorescence response (RFU), relative to full growth and fully inhibited (50 hM 
ribocil) controls and absorbance response (OD¢oo), relative to full growth and 
sterile controls, were fitted to four parameter (variable slope) curves. The con- 
centration of compound which decreased the specific fluorescence signal by 50% 
is reported as the GFP ECsp. 

FEMN binding to riboswitch aptamer. Aptamers were first re-annealed at a 
20M concentration in 4mM KH>2PO,, 16mM Kj,HPO,, 64mM KCl and 
0.1 mM EDTA, pH 7.4 buffer by heating at 95 °C for 5 min followed by incubation 
at room temperature for 15 min. Only one re-annealing cycle was performed per 
aptamer sample. A 1.25-fold serial dilution of the re-annealed aptamer was pre- 
pared to have a final concentration ranging from 6.6 to 150nM in 50 mM Tris- 
HCl, 100mM KCl and 2mM MgCl, assay buffer, pH 7.4. This was mixed with 
FMN ligand (30-240 nM final concentration). Fluorescence signal was read using 
the Spectramax M5 at an excitation wavelength of 455 nm and emission wave- 
length at 525 nm with cut-off filter at 515 nm. The instrument was set up in kinetic 
mode to acquire data every 20s. The steady-state Ky and the binding-competent 
fraction of aptamer were determined from fluorescence data obtained at 120 min 


of the reaction by fitting to a quadratic equation fully describing the binding 
equilibrium under tight-binding conditions. 

Competition binding kinetics. A twofold serial dilution of compounds was 
prepared to have a final assay concentration range from 1.22 to 10,000nM. 
This was prepared in 50mM Tris-HCl, 100mM KCl and 2mM MgCl, assay 
buffer, pH 7.4, with 0.2% DMSO. FMN ligand concentration was 60nM and 
the E. coli FMN aptamer concentration was 48 nM or, for ribocil, 150 nM. The 
fluorescence signal was read on the Spectramax M5 as described above. The 
steady-state binding competition data at 120 min was fitted to a cubic equation 
fully describing the competition binding equilibria to derive the K; value for the 
compound, while fixing K, to the value obtained earlier for FMN binding. The 
binding kinetics data was fitted by KinTek Explorer-based numerical integration 
with K, constrained to derive the dissociation rate constant (k,¢,). The association 
(kon) rate constant is then calculated from K; and kogr. 

Mammalian cytotoxicity assay. The Click-iT EdU Alexa Fluor 488 HCS assay kit 
(Life Technologies, C10351) was used to assess the potential cytoxicity of ribocil 
in mammalian cell cultures using a modified version of the manufacturer’s pro- 
tocol. For the assay, mycoplasma-tested HeLa cells (ATTC) were seeded at 4,000 
cells per well in 384-well poly-p-lysine-coated plates (Greiner, 781946) in 25 ll of 
culture medium (Optimem I, Life Technologies) and treated with a 20-point 
twofold dilution series of ribocil. After addition of EdU (5 uM) and incubation 
(37 °C) for 24h, images were captured and analysed using an Acumen eXC3 (TTP 
Labtech Ltd) laser scanning cytometer. Total cell numbers were determined using 
Hoechst 33342 (Life Tech, H3530). Ribocil displayed no HeLa cell cytoxicity as 
detected by cell count (ECs9 = 100 um, activity at 100 1M = 17%) or EdU mea- 
surement (ECs9 = 100 uM, activity at 100 uM = 11%). 
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Extended Data Figure 1 | Enzymatic steps responsible for riboflavin, FMN, 
and FAD biosynthesis in E. coli. a, Enzyme names are shown above reaction 
arrows. RibA is a GTP cyclohydrolase II, RibB is a (3S)-3,4-dihydroxy-2- 
butanone 4-phasphate synthase. RibDG is a bifunctional enzyme encoding 
2,5-diamino-6-ribosylamino-4(3H)-pyrimidinone 5'-phosphate deaminase 
and 5-amino-6-ribosylamino-2,4(1H,3H)-pyrimidinedione 5’-phosphate 
reductase activity. RibH is a lumazine synthase. RibE is a riboflavin synthase. 
RibFC is another bifunctional enzyme encoding flaviokinase and FAD 
synthetase activity. Note, one molecule of GTP and two molecules of 
ribulose 5-phosphate are required to make one molecule of riboflavin. 

I, 2,5-diamino-6-ribosylamino-4(3H)-pyrimidinone 5’ phosphate; II, 
5-amino-6-ribosylamino-2,4(1H,3H)-pyrimidinedione 5’-phosphate; III, 
5-amino-6-ribitylamino-2,4(1H,3H)-pyrimidinedione 5’-phosphate; IV, 
5-amino-6-ribitylamino-2,4(1H,3H)-pyrimidinedione; V, (3S)-3,4-dihydroxy- 
2-butanone 4-phosphate; VI, 6,6-dimethyl-8-D-ribityllumazine. Note two 
molecules of VI dismutate (step 7) to give IV and riboflavin. Note also that the 
phosphatase converting III to IV (step 4) is not known. Adapted from 
Pedrolli et al.”*. b, HPLC-based quantitative analysis of riboflavin, FMN, and 
FAD levels in E. coli strain MB5746 following genetic inactivation of riboflavin 


biosynthesis (AribA or AribB) or ribocil drug treatment. Wild-type strain 
MB5746 and isogenic AribA or AribB strains were grown overnight in CAMHB 
media containing 10 11M riboflavin, washed in CAMHB media lacking 
riboflavin, diluted (1:50) and grown for an additional 20 h before harvesting 
cells and HPLC analysis of cell lysates as described in the Methods. In parallel, 
overnight cultures of MB5746 grown in CAMHB were diluted (1:50) and 
treated with DMSO as a control or 20 uM ribocil, grown for an additional 20 h 
and analysed as above (middle panel). Data are presented as the mean of two 
technical repeats and are representative of two independent experiments. 
Riboflavin, FMN and FAD depletion levels are listed as a percentage relative to 
wild-type controls (right panel). c, HPLC-based quantitative analysis of 
riboflavin levels in 19 independent E. coli ribocil® mutant isolates versus the 
isogenic parent strain, MB5746. Overnight cultures of each strain were 
diluted 1:50 in CAMHB and treated with 10 11M ribocil or mock treated 

(1% DMSO). After growth at 37 °C with agitation for 20h, cell lysates were 
prepared and riboflavin levels were quantitated as described in the Methods. 
Data are the mean of two technical repeats (error bars indicate range) and is 
representative of two independent experiments. 
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Extended Data Figure 2 | E. coli FMN riboswitch-regulated reporter gene 
expression and ribocil-mediated inhibition and isolation of ribocil™ 
mutations mapping to A. baumannii and P. aeruginosa FMN riboswitches. 
a, GFP reporter constructs under the control of the intact E. coli ribB promoter 
and FMN riboswitch (EcPro-EcCFMN-GFP), or replaced with the A. baumannii 
FMN riboswitch (EcPro-AbFMN-GFP) or P. aeruginosa FMN riboswitch 
(EcPro-PaEMN-GEP) are maintained in E. coli ribocil® mutant, M5 (Fig. le). 
Note, E. coli ribB upstream promoter sequence (EcPro) was fused to 

A. baumannii and P. aeruginosa FMN elements (ADFMN and PaFMN, 
respectively) to facilitate sufficient baseline expression in E. coli MB5746 host 
cells. b, Introduction of FMN riboswitch reporter plasmids into a ribocil® 
mutant strain background (M5) enables ribocil dose-dependent inhibition of 
GFP expression without inhibiting cell growth. Dose-response curve for 16 
technical repeats of ribocil-mediated inhibition of EcPro-EcCFMN-GFP 


expression is shown. c, Tabulated ECs values (+s.d.) of ribocil required to 
inhibit EcPro-EcCFMN-GFP, EcPro-AbFMN-GFP and EcPro-PaFMN-GFP 
expression. Data for EcPro-EcFMN-GFP are from eight independent 
experiments each with two technical repeats, whereas data for EcPro-AbDFMN- 
GEP and EcPro-PaFMN-GFP are from two independent experiments each 
with four technical repeats. d, Schematic summary of E. coli recombineered 
ribB locus, EcPro-AbFMN-ribB, in which the endogenous FMN riboswitch is 
replaced with the A. baumannii FMN riboswitch. Below, predicted secondary 
structure of the A. baumannii FMN riboswitch and ribocil® mutations 
highlighted in red. e, Schematic summary of E. coli recombineered ribB locus, 
EcPro-PaFMN-ribB, in which the E. coli FMN riboswitch is replaced by the 
P. aeruginosa FMN riboswitch. Below, predicted secondary structure of the 
P. aeruginosa FMN riboswitch and ribocil® mutations highlighted in red. 
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Extended Data Figure 3 | Ribocil mutant analysis; Ribocil:FMN 
competition binding and Ribocil-analogue binding studies. a, Wild-type 
FMN and mutant constructs responsive to ribocil are displayed with a curve fit. 
Fluorescence was measured using 405 nm excitation and 510 nm emission 
and data represent the mean of two independent experiments (error bars 
indicate range). EC;9, the maximum cell-density-adjusted fluorescence signal 
observed with no ribocil addition, and the maximum per cent inhibition of 
fluorescence for each construct is listed in the accompanying table. b, Wild-type 
RNA aptamer samples were separated on 1.2% agarose gels and visualized 
by ethidium bromide staining. c, Binding affinity for ribocil. Binding affinity to 
the E. coli FMN riboswitch aptamer is determined from ribocil dose-dependent 
competition against a fixed concentration of FMN (60 nM) and a fixed 
concentration of E. coli FMN riboswitch aptamer (150 nM). Shown is a 
representative example of ribocil competition data. Mean affinity (+s.d.) from 
four independent experiments for ribocil, ribocil-A, ribocil-B and ribocil-C 

is reported in the table (panel e of this figure). d, Chemical structure of 
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ribocil-A, ribocil-B and ribocil-C. e, Steady-state Kg, binding kinetics, riboflavin 
biosynthesis inhibition and minimum inhibitory concentration (MIC) in 

E. coli MB5746 for ribocil-A, ribocil-B and ribocil-C. Binding kinetics were 
determined using a competition method employing a fixed amount of FAN 
(60 nM) and E. coli FMN riboswitch (48 nM). FMN binding affinity and 
kinetics are determined by fluorescence quenching experiments (see Methods). 
Given the tight-binding conditions, both FMN and ribocil affinity values 
represent an upper limit. As a result, the ko¢ values may also be an upper limit, 
and/or the k,, values may be a lower limit. Inhibition of riboflavin biosynthesis 
after treatment of E. coli MB5746 with ribocil, ribocil-A, ribocil-B and ribocil-C 
for 20h as described in Methods. Data for ribocil are the mean (+s.d.) of 
three independent experiments, for ribocil-B the mean (+s.d.) of four 
independent experiments and for ribocil-C the mean (error bars indicate range) 
of two independent experiments. MIC of ribocil, ribocil-A, ribocil-B and 
ribocil-C against E. coli MB5746 determined by the broth microdilution 
method. 
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Extended Data Figure 4 | Electron density omit map of F. nucleatum impX 
FEMN aptamer-ribocil co-crystal structure. a, The electron density difference 
map (see Methods for details regarding its calculation) is shown as a grid 
contoured at 3.00 level. The refined structure of the ligand is shown as sticks, 
methyls in slate blue, the aptamer as lines with the methyl coloured in 
orange, cations as spheres coloured in purple. b, Same figure as a but rotated 
to bring the electron density around the planar 6-thiophenyl-pyrimidonyl 
horizontal and perpendicular to the figure plane. The best fit of the (S)-isomer 
which maintains the pyrimidiny] in its electron density, stacking against G62, 


and inserts the 6-thiophenyl-pyrimidonyl between A48 and A85 is shown (see 
Methods). The 6-thiophenyl-pyrimidony] is clearly slanted compared to the 
density and the planes of the A48 or A85 bases. c, Superposition of the X-ray 
co-structures of the F. nucleatum riboswitch aptamer with either FMN 

(PDB entry 3F2Q) or ribocil. After superposition using the phosphorus atoms 
of the bases in the immediate vicinity of the ligand, the RNA is represented 
as lines in cyan and orange for the co-structures with FMN and ribocil, 
respectively, and as sticks for the ligand, in cyan and slate blue, respectively. The 
number for the key bases interacting with either FMN or ribocil is indicated. 
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Extended Data Figure 5 | Homology model of the predicted ribocil binding 
site within the E. coliFMN aptamer. a, The homology model was constructed 
using program mutate_bases of the 3DNA package* using the F. nucleatum 
impX riboswitch aptamer X-ray structure as the template and the FMN 
aptamer alignment of E. coli, F. nucleatum, P. aeruginosa and A. baumannii. 
Only two bases differ (in red labels) compared to F. nucleatum FMN riboswitch, 
G66A and A95U. b, A full-length homology model of the E. coli FMN 
aptamer and mapping of ribocil® mutations. Location of the ribocil® mutants 
C33U, G37U, G93U, C100U, C111U and A94-102 are mapped on the E. coli 
homology model of the ribocil-bound E. coli FMN riboswitch aptamer. In 
the model, all nucleotide insertions in the E. coli sequence not found in 


F, nucleatum were not modelled in the resulting crystal structure. There are 
34 base changes among the 111 nucleotide modelled. A119 is removed for 
clarity. Nucleotides C33, G37, G93, C100 and C111 are coloured (green) and 
bases deleted in A94-102 are highlighted (red). G96 (red) which makes direct 
contact with ribocil (blue) in the wild-type aptamer is deleted in A94—102. 

c, Alignment of bacterial riboswitch sequences from RFAM™. E. coli aptamer 
nucleotide G96, which is equivalent to nucleotide G62 in the F. nucleatum 
aptamer, is indicated with an asterisk. Black shading represents identical and 
grey is similar using default consensus settings with the BOXCHADE program 
(http://sourceforge.net/projects/boxshade/). 
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Extended Data Figure 6 | Ribocil, roseoflavin and ribocil-C cross-resistance 
to E. coli ribocil® mutants. a, 5 il of ribocil (1.3 mM), roseoflavin (31 mM) 
and the negative control, novobiocin (2.5 mM), were spotted (twofold 
dilution series) on the surface of a CAMH plate (a) or CAMH plate plus 20 uM 
riboflavin (b) and seeded with E. coli MB5746. c-i, Same as a but plates seeded 
with the indicated ribocil® mutants. Note, whereas the growth-inhibiting 
activity of ribocil is completely suppressed by riboflavin, roseoflavin activity is 
only partially suppressed and ribocil® mutants are cross-resistant to roseoflavin 
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and display similar level of suppression as riboflavin supplementation. 

The figure is representative of two independent experiments. j, Ribocil-C 
inhibits the FMN riboswitch and is cross-resistant to ribocil® mutations. 5 pl of 
ribocil (1.3 mM), ribocil-C (153 |1M) and the negative control, novobiocin 
(2.5 mM), were spotted (twofold dilution series) on the surface of CAMH plates 
seeded with the above described strains and/or riboflavin supplement as shown 
in a-i. The figure is representative of three independent experiments. RF, 
riboflavin. 
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Compound _ [Starting] Location 
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Extended Data Figure 7 | Bioactivity of riboflavin analogues are not but supplemented with 20 1M riboflavin. Chemical structures are shown. The 
suppressed by riboflavin supplements. a, Left plates; 5 jl of ribocil (1.3mM), figure is representative of two independent experiments. b, Inhibition of 
roseoflavin (31 mM), C002 (2 mM), C003 (2 mM), C004 (2 mM), C005 flavin synthesis by COO6. Flavin levels were determined by HPLC after C006 
(2 mM), C006 (2 mM), C007 (2 mM), C008 (2 mM), and the negative control, treatment of MB5746. All data are from a single 11-point dose titration and 
Chiron 90 (9 1M), were spotted (twofold dilution series) on the surface of a is representative of two independent experiments. 


CAMH plate seeded with E. coli MB5746. Right plates; same as left plates 
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6 RK 


Log CFU/g Spleen +/- SEM 


Vehicle 30.0mpk 60.0mpk 120.0mpk 0.5 mpk 


Ribocil-C — Ribocil-C — Ribocil-C Ciprofloxacin 


fl 9.75 log CFU/g spleen 
8 9.66 log CFU/g spleen 
§) 8.31 log CFU/g spleen 


I 7.65 log CFU/g spleen 
[_] 5.51 log CFU/g spleen 


Extended Data Figure 8 | In vivo activity of ribocil-C in a murine systemic 
infection model of E. coli. a, DBA/2J mice were infected i-p. with E. coli 
strain MB5746 (5.0 X 10° CFU per mouse, a tenfold higher inoculum than that 
used in Fig. 4) and treated by subcutaneous injection with ribocil-C or 
ciprofloxacin at the indicated doses (mg kg” ') three times over a 24h infection 
period. Spleens were aseptically collected from five mice per group and the 
reduction of log[CFU per g spleen tissue] was calculated on the basis of bacterial 
burden in spleens of the vehicle-treated (10% DMSO) control group. Data 
represents the average CFU per g spleen + s.e.m. One-way ANOVA with 
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Log CFU/g Spleen +/- SEM 


Vehicle 0.5 mpk 


30.0 mpk 
Ribocil-C — Ribocil-C — Ribocil-C Ciprofloxacin 


60.0 mpk 120.0 mpk 


® 9.75 log CFU/g spleen 
® 9.66 log CFU/g spleen 
® 8.31 log CFU/g spleen 


® 7.65 log CFU/g spleen 
O 5.51 log CFU/g spleen 


Dunnett’s multiple comparison test demonstrates statistically significant 

(*P < 0.05, ***P < 0.001) log[CFU per g spleen] reductions in the 60 and 
120mg kg ' ribocil-C treatment groups and ciprofloxacin control. b, The CFU 
data from each mouse plotted as individual points, solid bar and error bars 
represent the average CFU per g spleen +s.e.m. One-way ANOVA with 
Dunnett’s multiple comparison test demonstrates statistically significant 

(*P < 0.05, ***P < 0.001) log[CFU per g spleen] reductions in the 60 and 
120 mg kg | ribocil-C treatment groups and ciprofloxacin control. 
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Extended Data Table 1 | Frequency of resistance (FOR) determination and microbiological activity summary 


a 
MIC for Ribocil 2 pg/mL 
8X MIC = 16 ug/mL 
1x107 CFU/plate, 3 plates/condition 
FOR 
CAMH Media Minimal Media 
MB5746 2.6x10° 3.4x10 
MB5746-Ab-RFN 3.3x10% <3.3x108 
MB5746-Pa-RFN 6.4x107 ND 
b 
E. coli MB5746/ E. coli MB5746/ 
Escherichia coli Pseudomonas aeruginosa RFN ribB Acinetobacter baumannii RFN ribB 
Highest test MB5746 
conc. MB5008 MB5747 IpxC- Pa-2 Pa-17 Ab-14 Ab-17 
Compound (ug/mL) IpxC- tolC::tn10 tolC::tn10 Pa-WT (-193 mut) — (-317 mut) Ab-WT (-57 mut) (-9 mut) 
Ribocil 64 16 2 2 16 >64 >64 2 >64 >64 
Roseoflavin 128 >128 >128 >128 >128 >128 4 >128 >128 
Ribocil-A 128 >128 >128 >128 >128 >128 64 >128 >128 
Ribocil-B 128 8 1 1 >128 >128 1 128 >128 
Ribocil-C 64 2 0.25 0.25 >64 >64 <0.063 4 >64 
Penicillin G 64 >64 >64 >64 >64 >64 >64 >64 >64 
Novobiocin 64 4 0.5 0.5 0.25 0.25 0.25 0.25 0.5 
Chloramphenicol 64 1 0.5 0.5 : 0.5 0.5 0.5 0.5 0.5 
Nalidixic acid 64 1 0.5 0.5 : 0.5 0.5 0.5 
c 
Highest test Pseudomonas aeruginosa Acinetobacter baumannii 
conc. ATCC35151 MB5890 ATCC12457 ATCC19606 
Compound (ug/mL) Z61 Multi-efflux AlpxA AlpxC 
Ribocil 128 >128 >128 >128 >128 
Ribocil-C 64 >64 >64 >64 >64 
Roseoflavin 128 >128 >128 >128 >128 
Penicillin G 64 <0.063 >64 >64 32 
Novobiocin 64 0.5 64 8 4 
Chloramphenicol 64 0.5 0.5 8 8 
Nalidixic acid 64 1 0.5 0.5 0.25 


d 


Highest test Saccharomyces cerevisiae Candida albicans 
conc. CAF2-1 CDR1/CDR1 DSY448 Acdr1/Acdr1 DSY654 Acdr1/Acdr1 

Compound (ug/mL) S$288c Sc Apars CDR2/CDR2 CDR2/CDR2 Acdr2/Acdr2 
Ribocil 32 >32 >32 >32 >32 >32 
Ribocil-C 32 >32 >32 >32 >32 >32 
Roseoflavin 32 >32 >32 >32 >32 >32 
Fluconazole 16 4 0.5 >16 (0.5) 1 (0.25) 1 (0.25) 
Amphotericin B 16 1 1 1 1 
Caspofungin 4 0.5 (0.25) 0.5 (0.25) 0.125 0.125 0.125 


a, Frequency of resistance to ribocil by E. coli strain MB5746 carrying A. baumannii, P. aeruginosa or native RFN. Testing was performed with both Mueller Hinton (CAMH) and M9 (Minimal Media) broth. 

b, Antibacterial activity of ribocil analogues and standard controls tested against: (1) efflux- and permeability-deficient E. coli MB5746 and two progenitor strains; (2) E. coli strain MB5746 carrying wild-type 
and mutant alleles of A. baumannii and P. aeruginosa RFN. Minimum inhibitory concentrations (MICs) were determined by CLSI standard broth microdilution. ¢, Antibacterial activity of ribocil analogues and 
standard controls tested against sensitized P. aeruginosa and A. baumannii isolates. MICs were determined by CLSI standard broth microdilution. d, Antifungal activity of ribocil analogues and standard controls 
tested against wild-type and efflux-deficient Saccharomyces cerevisiae and Candida albicans. MICs were determined by CLSI standard broth microdilution. All MIC values are expressed as pg ml}. 
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Extended Data Table 2 | X-ray crystal data collection and refinement 


FMN riboswitch-ribocil 


Data collection 


Space group P3,21 
Cell dimensions 
a, b, c (A) 71.1,71.1,138.5 
a, B, y (°) 90.0, 90.0, 120.0 
Resolution (A) 2.93 (3.27-2.93)* 
Rmerge 3.8% (39.0%) 
Vol 36.2 (6.2) 
Completeness (%) 99.9% (100%) 
Redundancy 9.5(9.8) 
Refinement 
Resolution (A) 35.6 — 2.93 


No. reflections 
Rwork/Riree 
No. non-H atoms 
RNA 
Ligand/ion 
Water 
B-factors 
RNA 
Ligand/ion 
Water 
R.m.s deviations 
Bond lengths (A) 
Bond angles (°) 
Coord. error from Luzzati plot (A) 
Molprobity score 


Work 8,737 free 442 
21.3%/23.4% 


2,340 
27/8 
0 


103.3 
85.8/113.7 
N/A 


0.0098 

0.74 

0.452 

2.33 (97th percentile) 
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A full data set was collected with one crystal. Coordinate error estimated from Luzzati plot using 
SFCHECK®®. 
*Highest resolution shell is shown in parenthesis. 
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The composition of the neutral gas comas of most comets is domi- 
nated by H,O, CO and CO), typically comprising as much as 95 per 
cent of the total gas density’. In addition, cometary comas have 
been found to contain a rich array of other molecules, including 
sulfuric compounds and complex hydrocarbons. Molecular oxygen 
(O2), however, despite its detection on other icy bodies such as the 
moons of Jupiter and Saturn’, has remained undetected in comet- 
ary comas. Here we report in situ measurement of O, in the coma 
of comet 67P/Churyumoy-Gerasimenko, with local abundances 
ranging from one per cent to ten per cent relative to H,O and with 
a mean value of 3.80 + 0.85 per cent. Our observations indicate 
that the O,/H,0 ratio is isotropic in the coma and does not change 
systematically with heliocentric distance. This suggests that prim- 
ordial O2 was incorporated into the nucleus during the comet’s 
formation, which is unexpected given the low upper limits from 
remote sensing observations*. Current Solar System formation 
models do not predict conditions that would allow this to occur. 

Measurements of the coma of 67P/Churyumov-Gerasimenko 
(hereafter 67P) were made between September 2014 and March 
2015 with the ROSINA-DFMS mass spectrometer® on board the 
Rosetta spacecraft. For the present study, we analysed 3,193 mass 
spectra taken in this time period. Because of the high mass resolving 
power and sensitivity of ROSINA-DFMS, it was possible to differenti- 
ate unambiguously between the three main species present in the 
narrow mass range centred at mass-to-charge (m/z) ratio 32 Da/e, 
namely, molecular oxygen (O2), atomic sulfur (S) and methanol 
(CH30H); such differentiation has not been achieved by previous in 
situ or remote sensing measurements at comets. Figure 1 shows several 
DFMS measurements centred at the O, peak. The green and orange 
lines show data taken before the close encounter with 67P. Only minor 
signatures from the tenuous neutral gas atmosphere of the Rosetta 
spacecraft can be identified, and even after long thruster firing man- 
oeuvres, which use N2O, as an oxidizer, the contamination of the Oz 
signal remains small (see the green line in Fig. 1). Measurements taken 
while orbiting 67P, shown as the light blue, dark blue and purple lines 
in Fig. 1, show a clear increase of the O> signal, indicative of cometary 
O,. These three measurements were taken at decreasing distances (r) 
from the comet nucleus, and follow the predicted 1/r’ signal depend- 
ence that is expected for conserved cometary species, giving further 
confidence in our detection. 

As previously reported, the local number densities in the coma vary 
spatially and temporally®’ for different compounds. Figure 2 shows 


correlation plots of H,O with O2, CO and N>. Red symbols represent 
data from 17 to 23 October 2014, for which previously published data 
on Nz are available. During this time the spacecraft was in closed orbits 
at 10km from the nucleus centre. Black symbols are data from 1 
September 2014 to 31 March 2015. For both periods, O, clearly shows 
the strongest correlation with H,O. While CO shows a high correlation 
with H,O from 17 to 23 October 2014, the correlation for the whole 
data set is fairly low. N2 shows the weakest correlation with H2O of all 
three species. The strong correlation between H,O and OQ), with a 
Pearson correlation coefficient of 0.88 (and even 0.97 for the 
October data), indicates that they are of similar origin in the nucleus 
and that their release mechanisms are linked, in contrast to CO and 
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Figure 1 | DFMS mass spectra around 32 Da/e normalized to the spectrum 
with the largest signal. The black labels indicate the three major species 
found in the coma of 67P at 32 Da/e. The green labels and green line identify 
contamination peaks from thruster firings, showing that their contributions to 
the O, peak are very low. The light blue, dark blue and purple lines represent 
measurements taken at different distances from the comet nucleus. 
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Figure 2 | Correlation between H,O and O,, CO and N;. a, HO and O,; 
b, H,O and CO; c, H,O and N>. All three panels share a common y axis. 
Numbers on x and y axes are proportional to number density but in 
arbitrary units. Red crosses mark a subset of data for which N, data are also 
available. Panel a shows the strong correlation between H,O and O,, which is 


N> which have a similar volatility but do not show a strong correlation 
with H,O (see Fig. 2 for correlation coefficient values). The O,/H,O 
ratio decreases for high H,O abundances, which might be caused by 
surface water ice produced by a cyclic sublimation-condensation pro- 
cess®, although the total amount of surface ice is limited’. 

A plausible mechanism for the strong O2/H2O correlation would be 
the production of O, by radiolysis or photolysis of water ice. Here we 
follow the convention that photolysis refers to ultraviolet photons 
breaking bonds, whereas radiolysis refers to more energetic photons 
or fast electrons and ions depositing energy into the ice and ionizing 
molecules. Creation of sputtered O, by radiolysis has been demon- 
strated in laboratory experiments’® and is observed for the icy moons 
of Jupiter—Europa, Ganymede and Callisto''’’—as well as for the 
rings of Saturn’. Comets are subject to radiolysis over various time- 
scales: (a) over billions of years, while they reside in the Kuiper belt; (b) 
over the period of a few years once they enter the inner Solar System; 
and (c) on very short timescales, as for the present radiolysis. In the 
Kuiper belt, the skin depth for producing O) is in the range of metres, 
although the produced O, may diffuse deeper into the porous nucleus. 
Once a comet begins its residence in the inner Solar System, it loses its 
surface material to a depth of several metres during each orbit around 
the Sun, therefore we can safely assume that no O) from radiolysis in 
the Kuiper belt phase remains in 67P at the percentage level. Radiolysis 
and photolysis by solar wind and ultraviolet radiation in the inner 
Solar System only affect the top few micrometres of the cometary 
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observed for all data. In contrast, the correlation of CO with H,O (b) varies 
over time, which leads to a low overall correlation between those two species. 


N; has the lowest correlation with HO of the compared species for the 
October data (c). 


surface. Taking account of 67P’s continuous mass loss through out- 
gassing, we estimate the actively outgassing surface areas to be lost toa 
depth of several centimetres over the time from August 2014 to March 
2015. If recent production by radiolysis or photolysis (only affecting 
the top few micrometres) were the source of the measured Os, our data 
would show a continuous decrease of the O2/H,0O ratio over the exam- 
ined time period as the active surface continues to be shed over that 
time. Apart from the variations related to H,O abundance, Fig. 3 
shows that we do not observe a systematic change in the O2/H,0O ratio 
over several months. Instantaneous creation of the measured O, by 
radiolysis or photolysis seems, overall, unlikely, and would lead to 
variable O, ratios due to different illumination conditions. Given that 
radiolysis and photolysis, on any of the discussed timescales, do not 
seem to be plausible production mechanisms, the preferred explana- 
tion of our observations is the incorporation of primordial O, into the 
cometary nucleus. 

Despite great efforts by remote sensing campaigns, information on 
primordial O, is still limited. Solid Oz has not yet been detected in 
interstellar ices, and upper limits for the O2/H2O ice ratios of <0.5 
and for O,/CO ratios of <1 are in agreement with our findings, but such 
high upper limits do not provide useful constraints’*’*. Gaseous O, has 
only been detected in two interstellar clouds so far*'*"’, and is generally 
known to have surprisingly low abundances*. Reports of very low upper 
limits for O, in a protostellar envelope suggest the material infalling to the 
accretion disk is very poor in molecular oxygen”®. This has been ascribed 
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Figure 3 | O,/H,O ratio over several months. There seems to be no 
systematic increase or decrease of the O/H,0 ratio. The variances happen on 
very short timescales and can be explained by the decrease of the O) ratio 

for high H2O abundances. It is not fully understood if the higher variability 
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of the O, ratio from October to the end of December 2014 can be attributed 
to orbital changes of the spacecraft or to physical changes of the cometary 
nucleus. 
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Figure 4 | DFMS spectra for some of the common products of radiolysis of 
water ice. a-c, Products are O,H (seen in a), '°O'8O and O3H; (seen in 

b) and O3 (not seen in c). These data were recorded on 20 October 2014 at 
around 01:00 utc. With the exception of O; (c), all the previously mentioned 


to the high volatility and reactivity of O2, in particular the rapid trans- 
formation of O and O, to H,O ice on cold grains”. 

However, in one of the two interstellar clouds where O, has been 
detected (the p Ophiuchi dense core), the chemically related species 
HO, and H,0, have been measured”! with gaseous abundance ratios 
of HO,/O, ~ H,0,/O, ~ 0.6 X 10°. Interestingly, the abundance 
ratios determined by DFMS for the coma of 67P are very close to 
these interstellar values: HO,/O, = (1.9+0.3) X 10° and H,0O,/ 
O, = (0.6 + 0.07) X 10? (see Fig. 4). If these gas-phase abundance 
ratios reflect those in the cometary ice, this would support the existence 
of primordial O,. The p Ophiuchi A core has been suggested to have 
experienced slightly higher temperatures of around 20-30 K over its 
lifetime (which is also typical of estimates for the comet-forming con- 
ditions in the outer early solar nebula), compared to ~10 K for most 
other dense interstellar clouds*’”. If higher temperatures are indeed 
needed to produce significant amounts of O», this would indicate that 
our Solar System was formed from an unusually warm molecular 
cloud, consistent with the low abundance of N> in 67P (ref. 23). 

One aspect of the present results that remains unexplained is the 
high value (a few per cent) of the O,/H,0O ratio in 67P. Models of gas- 
grain chemistry in molecular clouds predict O2/H0 ratios at least an 
order of magnitude lower’. They also over-predict ozone (O3): we 
found no evidence for the presence of ozone (see Fig. 4), with an upper 
limit of 1 X 10° relative to water. 

An alternative explanation for the presence of O2 is the incorpora- 
tion of gaseous O, into water ice in the protosolar nebula, and chemical 
models of protoplanetary disks do in fact show high abundances of 
gaseous O, in the comet forming zone™*. Rapid cooling from >100K 
to less than 30 K would then be needed to form amorphous water ice 
with trapped O, on dust grains. This could happen when young disks 
experience increased heating due to accretion bursts onto the star, 
followed by a rapid drop in temperature as soon as the burst is over. 
These O, rich grains then need to be accreted into larger bodies before 
further chemical modification occurs. 

Finally, we discuss radiolysis of icy grains before accretion. When 
produced by radiolysis in water ice, O2 can remain trapped in voids, 
while hydrogen can diffuse out’. This prevents the hydrogenation 
of O;, which is otherwise a dominant reaction for the destruction of 
molecular oxygen, and could lead to increased and stable levels of O2 in 
the solid ice”. Incorporation of such icy grains into the comet nucleus 
would explain the observed strong correlation with H,O, in contrast to 
N) which is trapped from the gas phase and shows a lower correlation 
with water. However, O3 (resulting from O, radiolysis) has been 
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species are measured and can clearly be identified in the mass spectra of DFMS. 
The grey area consists of the statistical error N//N and a 10% uncertainty 
from the individual pixel gains on the detector (here N is the number of 
molecules on the detector). 


reported to be trapped in Ganymede’s surface’®, and at such a concen- 
tration would just be detectable by DFMS for O, levels at a few per cent 
of H,0, but no O; could be detected. A further consequence would be 
that these icy grains have been incorporated into the comet mostly 
unaltered, a process that is much debated, but which has recently been 
proposed again” and that would also be in accordance with the mea- 
sured high D/H ratio in 67P (ref. 27). 

We note that our findings do not significantly affect our under- 
standing of the global distribution of elemental O in the interstellar 
medium, as O> in ice with an abundance of a few per cent relative to 
H,0 accounts only for a small fraction of the total oxygen inventory. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 


Received 29 June; accepted 18 August 2015. 


1. Bockelée-Morvan, D., Mumma, M. J. & Weaver, H. A. in Comets /I (eds Festou, M., 
Keller, U. H. & Weaver, H. A.) 391-423 (Univ. Arizona Press, 2004). 

2. Hall, D. T., Strobel, D. F., Feldman, P. D., McGarth, M. A. & Weaver, H. A. Detection of 
an oxygen atmosphere on Jupiter’s moon Europa. Nature 373, 677-679 (1995). 

3. Johnson, R. E. et al. Production, ionization and redistribution of O2 in Saturn’s ring 
atmosphere. Icarus 180, 393-402 (2006). 

4. Goldsmith, P. F. et al. Herschel measurements of molecular oxygen in Orion. 
Astrophys. J. 737, 96 (2011). 

5. Balsiger, H. et al. ROSINA — ROSETTA orbiter spectrometer for ion and neutral 

analysis. Space Sci. Rev. 128, 745-801 (2007). 

6. Hassig, M. et al. Time variability and heterogeneity in the coma of 67P/ 

Churyumov-Gerasimenko. Science 347, aaa0276 (2015). 

7. Luspay-Kuti, A. et al. Composition-dependent outgassing of comet 67P/ 

Churyumov-Gerasimenko from ROSINA/DFMS — implications for nucleus 

heterogeneity? Astron. Astrophys. http://dx.doi.org/10.1051/0004-6361/ 

201526205 (2015). 

8. De Sanctis, M. C. et a/. The diurnal cycle of water ice on comet 67P/Churyumov— 

Gerasimenko. Nature 525, 500-503 (2015). 

9. Capaccioni, F. et al. The organic-rich surface of comet 67P/Churyumov- 

Gerasimenko as seen by VIRTIS/Rosetta. Science 347, aaa0628 (2015). 

10. Brown, W. L. et al. Erosion and molecular formation in condensed gas films by 
electronic energy loss of fast ions. Nucl. Instrum. Methods 198, 1-8 (1982). 

11. Carlson, R. W. et al. Hydrogen peroxide on the surface of Europa. Science 283, 
2062-2064 (1999). 

12. Spencer, J. R., Calvin, W. M. & Person, M. J. Charge-coupled device spectra of the 
Galilean satellites: molecular oxygen on Ganymede. J. Geophys. Res. 100, 
19049-19056 (1995). 

13. Spencer, J. R. & Calvin, W. M. Condensed Oz on Europa and Callisto. Astron. J. 124, 
3400-3403 (2002). 

14. Vandenbussche, B. eta/. Constraints on the abundance of solid O2 in dense clouds 
from ISO-SWS and ground-based observations. Astron. Astrophys. 346, L57-L60 
(1999). 

15. Pontoppidan, K. et a/. A 3-5 um VLT spectroscopic survey of embedded young low 
mass stars |. Structure of the CO ice. Astron. Astrophys. 408, 981-1007 (2003). 


©2015 Macmillan Publishers Limited. All rights reserved 


16. Liseau, R. et al. Multi-line detection of O2 toward p Ophiuchi A. Astron. Astrophys. 
541, A73 (2012). 

17. Larsson, B. et al. Molecular oxygen in the Ophiuchi cloud. Astron. Astrophys. 466, 
999-1003 (2007). 

18. Yildiz, U. A. et al. Deep observations of O2 toward a low-mass protostar with 
Herschel-HIFI. Astron. Astrophys. 558, A58 (2013). 

19. Taquet, V., Ceccarelli, C. & Kahane, C. Multilayer modeling of porous grain surface 
chemistry. |. The GRAINOBLE model. Astron. Astrophys. 538, A42 (2012). 

20. Bergman, P. et al. Detection of interstellar hydrogen peroxide. Astron. Astrophys. 
531, L8 (2011). 

21. Parise, B., Bergman, P. & Du, F. Detection of the hydroperoxyl radical HO» toward p 
Ophiuchi A. Additional constraints on the water chemical network. Astron. 
Astrophys. 541, L11 (2012). 

22. Du, F., Parise, B. & Bergman, P. Production of interstellar hydrogen peroxide 
(H20z) on the surface of dust grains. Astron. Astrophys. 538, A91 (2012). 

23. Rubin, M. et a/. Molecular nitrogen in comet 67P/Churyumov-Gerasimenko 
indicates a low formation temperature. Science 348, 232-235 (2015). 

24. Walsh, C., Nomura, H. & van Dishoeck, E. The molecular composition of the planet- 
forming regions of protoplanetary disks across the luminosity range. Astron. 
Astrophys. 582, A88 (2015). 

25. Johnson, R. E. & Jesser, W. A. 02/03 microatmospheres in the surface of 
Ganymede. Astrophys. J. 480, L79-L82 (1997). 

26. Cleeves, L.|.eta/. The ancient heritage of water ice in the solar system. Science 345, 
1590-1593 (2014). 

27. Altwegg, K. et al. 67P/Churyumov-Gerasimenko, a Jupiter family comet with a 
high D/H ratio. Science 347, http://dx.doi.org/10.1126/science.1261952 
(2015). 


Acknowledgements Work at the University of Michigan was funded by NASA contract 
JPL-1266313. Work at the University of Bern was funded by the State of Bern, the Swiss 


LETTER 


National Science Foundation and the European Space Agency PRODEX Program. Work 
at Max-Planck-Institut ftir Sonnensystemforschung was funded by the Max-Planck 
Society and BMWI contract 50QP1302. Work at Southwest Research Institute was 
supported by subcontract 1496541 from the Jet Propulsion Laboratory. Work at 
BIRA-IASB was supported by the Belgian Science Policy Office via PRODEX/ROSINA 
PEA 90020. This work was carried out thanks to the support of the A*MIDEX project (no. 
ANR-11-IDEX-0001-02) funded by the ‘Investissements d’Avenir’ French Government 
programme, managed by the French National Research Agency (ANR). This work was 
supported by CNES grants at IRAP, LATMOS, LPC2E, UTINAM, CRPG, and by the 
European Research Council (grant no. 267255 to B.M.). A.B.-N. thanks the Ministry of 
Science and the Israel Space agency. Work by J.H.W. at Southwest Research Institute 
was funded by NASA JPL subcontract NAS703001TONMO7 10889. E.F.v.D. and C.W. 
are supported by A-ERC grant 291141 CHEMPLAN and an NWO Veni award. We 
acknowledge here the work of the whole ESA Rosetta team. 


Author Contributions A.B. performed data reduction, analysis and wrote the paper; 
KA. initialized and edited the paper and contributed to data interpretation; C.B., U.C., 
M.C,, T.LG., K.C.H., S.G., M.H., AJ., R.M., LLR., M.R., C.-Y.T. and T.S. contributed to data 
analysis and interpretation. A.B.-N. and O.M. contributed to data interpretation relevant 
to processes in ices. E.F.v.D and C.W. contributed to data interpretation and writing of 
sections concerning interstellar oxygen. H.B., J.-J.B., P.B., J.D.K., B.F., S.A.F., A.K., U.M., 
B.M., T.O., H.R., J.H.W. and P.W. contributed to experiment design, calibration and data 
interpretation. All authors discussed the results, and commented on and revised the 
manuscript. 


Author Information All ROSINA-DFMS data will be released to the PSA archive of ESA 
and to the PDS archive of NASA. Reprints and permissions information is available at 
www.nature.com/reprints. The authors declare no competing financial interests. 
Readers are welcome to comment on the online version of the paper. Correspondence 
and requests for materials should be addressed to A.B. (abieler@umich.edu). 


29 OCTOBER 2015 | VOL 526 | NATURE | 681 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


METHODS 


Data reduction. The integration time for all evaluated spectra is 20s. A single 
mass spectrum consists of the signals of 512 individual pixels, which are arranged 
along the dispersive axis of the mass spectrometer. We take the gain degradation of 
each individual pixel over time into account by determining its gain with a cal- 
ibration sequence dedicated to this task every few weeks. 

For every recorded mass spectrum, a third-degree polynomial is fitted to the 
baseline of the mass spectrum and subsequently subtracted from the signal. To 
avoid contamination in the baseline fit from the measured peaks, the centre of the 
mass spectrum (with the peaks) is ignored for those fitting purposes. Peaks are 
fitted with a Gaussian-shape curve. This introduces a systematic underestimation 
of the absolute magnitude of the signal, as at the 1% level and below the peaks are 
broader than Gaussian, but cancels out for the reported ratios of O2/HO as both 
species are underestimated similarly. In a post-processing procedure, outliers are 
removed. An outlier occurs if one of the following criteria is met: (a) the signal 
amplitude changes by more than two orders of magnitudes between two mea- 
surements; (b) the centre of the fitted peak is not within a given window around 
the predicted position; (c) the width of the fitted peak curve is wider than a set 
limit. 

For the CO measurements, the fraction contributed to the signal by CO frag- 
ments from CO) is subtracted. As DFMS measures one mass line at a time in high 
resolution mode, there are no simultaneous measurements of two species that are 
separated by more than 1 Da/e. To calculate ratios, the two measurements have to 
occur within 20 min; if no pair of measurements can be found within 20 min, the 
data point is ignored. In our error analysis we include uncertainties for the indi- 
vidual gain levels of the detector, the species branching ratios and their relative 
sensitivities as systematic errors. Individual pixel gains are treated as statistical 
error, as the peak position on the detector is not constant. A careful analysis 
confirmed that the O2/H,O ratio is independent of the peak centre positions. 
Sample size. No statistical methods were used to predetermine sample size. 
Spacecraft outgassing background. To clearly identify the measured O, as comet- 
ary in origin, all non-cometary sources of O must be considered and excluded. 
The Rosetta spacecraft produces a neutral gas cloud of its own, mainly due to 
diffusion of volatiles out of spacecraft material and desorption of re-deposited 
volatiles from the spacecraft. For example, by changing the spacecraft attitude, 
different spacecraft elements are illuminated by the Sun, which then warm up and 
release condensed gas. The orange line in Fig. 1 shows the low-level signals from 
this spacecraft contamination for O2, S and CH30OH, referred to as “background”. 
Those measurements were taken several days before the encounter with 67P. It is 
not possible to distinguish this background signal from any potential cometary 
signature with DFMS, but it has been well characterized before the arrival at 67P 
and is usually orders of magnitude lower than the measured O, signals”. To keep 
the background influence as low as possible, we only considered mass spectra 
where both the O, and H,O abundances are at least 5 times larger than the 
corresponding spacecraft contamination. Another potential source of Oy is the 
oxidizer, N2O,4, used by the Rosetta spacecraft during thruster firings. 
Measurements taken shortly after a large thruster firing manoeuvre from June 
2014 (still before arrival at the comet) show minor contaminations around 
32 Da/e, but not directly affecting the O2 peak (see green curve in Fig. 1). 
Although contamination from thruster firings is small, DFMS measurements 
are usually performed hours after thruster firings, in order to minimize influence 
thereof. Finally we can exclude the production of O, inside the instrument 
through a careful review of all oxygen-bearing molecules up to 150 Da/e, which 
could potentially fragment into O, in the DFMS electron impact ion source. Many 
minor species contain O, but these are too low in abundance to account for the 
large amount of O, detected. The remaining possibility is CO2, which is very 
abundant in the coma of 67P (ref. 6). However, owing to its molecular structure it 
only fragments into CO and O, not into Op (ref. 29). Finally, we exclude the 
production of O2 from HO in the instrument. For 81 mass spectra taken from 
May to the end of June 2014 (before the encounter with 67P) we determine an O2 
abundance of (0.18 + 0.07)% relative to H2O, which is a factor of 20 lower than 
the cometary values. 

Coma chemistry. The production of O from atomic oxygen in the neutral gas 
coma of 67P is possible through the reaction 


O0+0OH—0O,+H 


Owing to the tenuous nature of the coma, only very few collisions are expected 
within the first 100 km above the nucleus surface*®. With increasing activity and 
therefore denser coma one would expect a higher O2/H20 ratio as both reactants 
(atomic oxygen and OH) are related to H2O as well as the collision frequency. This 
is clearly not observed. Production of a significant amount of O2 from coma 
chemistry hence seems unfeasible. 


Correlation with H,O. The measured O, signal shows a very strong dependence 
on radial distance (r) from the comet. It increases by roughly one order of 
magnitude when the radial distance from the comet decreases from 30km to 
10km. This is in agreement with a predicted 1/r? dependence of the number 
density profile of a non-reactive species. Examining the data further, we observe 
a strong correlation between HO and Oy (see Fig. 2, Pearson correlation coef- 
ficient R = 0.88) for data from September 2014 to March 2015. This correlation 
indicates that O2 and HO are both of a similar cometary origin. In contrast, there 
is no correlation between O, and HO for measurements taken before the arrival at 
the comet (R = —0.01). The observed temporal variations in the O2/H20 ratio are 
largely due to a nonlinear correlation between H,O and O, for high water densi- 
ties, where the O, ratio drops with increasing HO abundance. The correlation 
similarly supports the ruling out of CO asa source of the Oz, as previous studies” 
and Fig. 2 show a lack of correlation between the abundance of H,O and other 
species like CO, CO, and Np. 

Radiolysis. The production of O, from water ice by radiolysis is the result of 
several reactions, where initially H, O and OH are produced, followed by sub- 
sequent rearrangement to form Hz, HO, and H,O, and ultimately O, (ref. 2). Ina 
systematic study of the irradiation of pure cubic crystalline water ice at 10 K, it was 
estimated that comets in the Oort cloud would reach an equilibrium concentration 
of about 0.6% of HO; after ~ 10° years, and that the concentration of O2 would be 
significantly lower*'. As stated by the authors, the relative proportion of O, and 
H,0Q) in actual astrophysical ices can be different as the presence of H2O. ices may 
enhance the production of O, and vice versa. 

Cosmic rays consist of energetic particles (mostly H*) that can penetrate inside 
the cometary nucleus. Their penetration depth depends on their energy. The more 
energetic a cosmic ray, the deeper it deposits its energy into the cometary nucleus. 
Only the most energetic components, the Galactic cosmic rays (GCRs), can pen- 
etrate deeply into the cometary body. Today, the GCR flux peaks at energies 
around a few hundreds of MeV. At energies higher than ~1 GeV, the GCR flux 
drops exponentially, the flux at 10 GeV being approximately 100 times lower than 
the flux at 1 GeV. GCRs with energies near the GCR maximum of flux typically 
deposit their energy in the nucleus at depths of the order of metres to tens of 
metres*. There is some uncertainty on the evolution of the cosmic ray flux over the 
history of the Solar System related to the occurrence of supernovae which can 
drastically increase the cosmic ray flux at high energies (up to ~PeV) during 
relatively short time periods (of the order of 10° years) or to the orbital motion 
of the Sun and of the Galaxy. There are also uncertainties on the cross-sections, the 
relative proportion of high-Z cosmic rays, and the role of porosity and defects in 
the cometary ice™’. 

During 67P’s stay in the Kuiper belt (~4.5 billion years) GCRs may have pro- 
duced a significant amount of O. This O, is produced where the GCR energy is 
deposited, that is, in the first tens of metres below the comet surface. Once come- 
tary objects enter the inner Solar System they lose (owing to their activity and 
depending on their perihelion distance) material from the surface in the range of 
several metres per orbit around the Sun. Therefore, as most products built up 
during the stay in the Kuiper belt reside in the outer few metres, O2 produced by 
radiolysis should be released quickly on the first solar passages. 67P’s perihelion 
distance has been within the orbit of Jupiter for the past 250 years, possibly even for 
more than 5,000 years, and since a close encounter with Jupiter in 1959, the 
perihelion distance of 67P has been about 1.3 Au with an orbital period of 6.4 years 
(ref. 34). Accumulated over the last perihelion passages we can assume that 67P 
has lost hundreds of metres from its surface. 

However, O2 molecules may have diffused into the comet and penetrated into 
deeper regions of the cometary nucleus. According to the estimated erosion of the 
comet, this requires significant diffusion of the O2 molecules. Such efficient dif- 
fusion would dilute the produced Oz, resulting in a lower O/H0O ratio hardly 
compatible with the elevated amount of O, observed by DFMS. 

Recent radiolysis. The flux of the low energy component such as solar energetic 
particles (SEP) is several orders of magnitude higher than the flux of GCRs. The 
SEP flux peaks at energies around a few tens of MeV and only penetrates the first 
centimetres of the comet*. Considering the comet erosion rate, the production of 
O, by low energy cosmic rays would have to be very fast to account for the 
observed amount of O5, requiring unrealistically high O2 production rates. 
Consequently, it appears unlikely that most of the O, in 67P has been built up 
over the past few years. 

O, production through surface sputtering. It has been shown that sputtering of 
refractory materials from the cometary surface due to the solar wind is occurring at 
67P (ref. 35) and a clear difference between the southern and northern latitudes in 
the measured abundances of the sputtered species was demonstrated. This appar- 
ent spatial difference is explained via the asymmetry of the neutral coma, with 
higher number densities in the northern hemisphere that is preferentially exposed 
to the Sun for the time span under study. The solar wind, which is responsible for 
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the sputtering, is therefore attenuated more efficiently by these denser parts of the 
coma and thus has limited or no access to the surface. We find that the O2/H2,O 
ratio is independent of latitude and relatively constant over a period of 7 months. 
Furthermore, the major part of the top surface of 67P accessible by the solar wind 
does not contain any water ice’. This suggests that sputtering cannot be the main 
source of the detected molecular oxygen. Moreover, the sputter yields by solar 
wind ions are orders of magnitude too low to explain the observed amount of O>. 
Code availability. The data reduction software was written in the Julia and Python 
programming languages and is available upon request from A.B. 
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Loophole-free Bell inequality violation using 
electron spins separated by 1.3 kilometres 
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R. N. Schouten’, C. Abellan?, W. Amaya’, V. Pruneri®*, M. W. Mitchell**, M. Markham’, D. J. Twitchen®, D. Elkouss’, 


S. Wehner!, T. H. Taminiau>? & R. Hanson’? 


More than 50 years ago’, John Bell proved that no theory of nature 
that obeys locality and realism’ can reproduce all the predictions of 
quantum theory: in any local-realist theory, the correlations 
between outcomes of measurements on distant particles satisfy 
an inequality that can be violated if the particles are entangled. 
Numerous Bell inequality tests have been reported*'’; however, 
all experiments reported so far required additional assump- 
tions to obtain a contradiction with local realism, resulting in 
‘loopholes’”*"'*. Here we report a Bell experiment that is free of 
any such additional assumption and thus directly tests the principles 
underlying Bell’s inequality. We use an event-ready scheme’ that 
enables the generation of robust entanglement between distant 
electron spins (estimated state fidelity of 0.92 + 0.03). Efficient 
spin read-out avoids the fair-sampling assumption (detection 
loophole’*’’), while the use of fast random-basis selection and spin 
read-out combined with a spatial separation of 1.3 kilometres 
ensure the required locality conditions'*. We performed 245 trials 
that tested the CHSH-Bell inequality? S=<2 and found 
S= 2.42 + 0.20 (where S quantifies the correlation between mea- 
surement outcomes). A null-hypothesis test yields a probability 
of at most P = 0.039 that a local-realist model for space-like sepa- 
rated sites could produce data with a violation at least as large as 
we observe, even when allowing for memory’®”’ in the devices. 
Our data hence imply statistically significant rejection of the 
local-realist null hypothesis. This conclusion may be further con- 
solidated in future experiments; for instance, reaching a value of 
P= 0.001 would require approximately 700 trials for an observed 
S= 2.4. With improvements, our experiment could be used for 
testing less-conventional theories, and for implementing device- 
independent quantum-secure communication” and randomness 
certification”*™*. 

We consider a Bell test in the form proposed by Clauser, Horne, 
Shimony and Holt (CHSH)”° (Fig. 1a). The test involves two boxes 
labelled A and B. Each box accepts a binary input (0 or 1) and subse- 
quently delivers a binary output (+1 or —1). In each trial of the Bell 
test, a random input bit is generated on each side and input to the 
respective box. The random input bit triggers the box to produce an 
output value that is recorded. The test concerns correlations between 
the output values (labelled x and y for boxes A and B, respectively) and 
the input bits (labelled a and b for A and B, respectively) generated 
within the same trial. 

The discovery made by Bell is that in any theory of physics that is 
both local (physical influences do not propagate faster than light) and 
realistic (physical properties are defined before, and independent of, 
observation) these correlations are bounded more strongly than they 
are in quantum theory. In particular, if the input bits can be considered 
free random variables (condition of ‘free will’) and the boxes are 


sufficiently separated such that locality prevents communication 
between the boxes during a trial, then the following inequality holds 
under local realism: 


S= (XY) 0.0) + (Yay + (*-¥) 0) = (*Y)aay <2 (1) 


where (x - y)(a,b) denotes the expectation value of the product of x and y 
for input bits a and b. (A mathematical formulation of the concepts 
underlying Bell’s inequality is found in, for example, ref. 25.) 

Quantum theory predicts that the Bell inequality can be significantly 
violated in the following setting. We add one particle, for example an 
electron, to each box. The spin degree of freedom of the electron forms 
a two-level system with eigenstates ||) and ||). For each trial, the two 
spins are prepared into the entangled state |W~ ) =(|t 1) —|J))/v2. 
The spin in box A is then measured along direction Z (for input bit 
a=0) or X (for a=1) and the spin in box B is measured along 
(—Z+X)/V2 (for b = 0) or (—Z—X)/V2 (for b = 1). If the mea- 
surement outcomes are used as outputs of the boxes, then quantum 
theory predicts a value of S= 2/2, which shows that the combination 
of locality and realism is fundamentally incompatible with the predic- 
tions of quantum mechanics. 

Bell’s inequality provides a powerful recipe for probing fundamental 
properties of nature: all local-realist theories that specify where and 
when the free random input bits and the output values are generated 
can be experimentally tested against it. 

Violating Bell’s inequality with entangled particles poses two main 
challenges: excluding any possible communication between the boxes 
(locality loophole’’) and guaranteeing efficient measurements (detec- 
tion loophole’*"’). First, if communication is possible, a box can in 
principle respond using knowledge of both input settings, rendering 
the Bell inequality invalid. The locality conditions thus require boxes A 
and B and their respective free-input-bit generations to be separated in 
such a way that signals travelling at the speed of light (the maximum 
allowed under special relativity) cannot communicate the local input 
setting of box A to box B, before the output value of box B has been 
recorded, and vice versa. Second, disregarding trials in which a box 
does not produce an output bit (that is, assuming fair sampling) would 
allow the boxes to select trials on the basis of the input setting. The fair 
sampling assumption thus opens a detection loophole’*”’: the selected 
subset of trials may show a violation even though the set of all trials 
may not. 

The locality loophole has been addressed with pairs of photons 
separated over a large enough distance, in combination with fast set- 
tings changes* and later with settings determined by fast random 
number generators”°. However, these experiments left open the detec- 
tion loophole, owing to imperfect detectors and inevitable photon loss 
during the spatial distribution of entanglement. The detection loop- 
hole has been closed in different experiments®*'°”, but these did not 
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Figure 1 | Bell-test schematic and experimental realization. a, Bell-test set- 
up: two boxes, A and B, accept binary inputs (a, b) and produce binary outputs 
(x, y). In an event-ready scenario, an additional box C gives a binary output 
signalling that A and B were successfully prepared. b, Experimental realization. 
The set-up consists of three separate laboratories, A, B and C. The boxes at 
locations A and B each contain a single NV centre in diamond. A quantum 
random-number generator (RNG) is used to provide the input. The NV 
electronic spin is read out in a basis that depends on the input bit, and the 
resultant signal provides the output. A box at location C records the arrival 
of single photons that were previously emitted by, and entangled with, the spins 
at A and B. c, Experimental set-up at A and B. The NV centre is located in a 
low-temperature confocal microscope (Obj.). Depending on the output of the 
RNG, a fast switch (Sw.) transmits one of two different microwave pulses 

(Po and P;) into a gold line deposited on the diamond surface (inset, scanning 


close the locality loophole. So far, no experiment has closed all the 
loopholes simultaneously. 

A Bell test that closes all experimental loopholes at the same time— 
commonly referred to as a loophole-free Bell test’*°—is of founda- 
tional importance to the understanding of nature. In addition, a loop- 
hole-free Bell test is a critical component for device-independent 
quantum security protocols’ and randomness certification®**. In 
such adversarial scenarios, all loopholes are ideally closed because they 
allow for security breaches in the system”’. 

One approach for realizing a loophole-free set-up was proposed by 
Bell himself"’. The key idea is to record an additional signal (dashed 
box in Fig. la) to indicate whether the required entangled state was 
successfully shared between A and B, that is, whether the boxes 
were ready to be used for a trial of the Bell test. By conditioning the 
validity of a Bell-test trial on this event-ready signal, failed entangle- 
ment distribution events are excluded upfront from being used in the 
Bell test. 


Le i 

WS fo See 

5 be 
Me 


electron microscope image). Pulsed red and yellow lasers are used to 
resonantly excite the optical transitions of the NV centre. The emission (dashed 
arrows) is spectrally separated into an off-resonant part (phonon side band, 
PSB) and a resonant part (zero-phonon line, ZPL), using a dichroic mirror 
(DM). The PSB emission is detected with a single-photon counter (APD). The 
ZPL emission is transmitted through a beam-sampler (BS, reflection =4%) 
and wave plates (//2 and //4), and sent to location C through a single-mode 
fibre. d, Set-up at location C. The fibres from A and B are connected to a fibre- 
based beam splitter (FBS) after passing a fibre-based polarizer (POL). Photons 
in the output ports are detected and recorded. e, Aerial photograph of the 
campus of Delft University of Technology indicating the distances between 
locations A, B and C. The red dotted line marks the path of the fibre connection. 
Aerial photograph by Slagboom en Peeters Luchtfotografie BV. 


We implemented an event-ready Bell set-up’*”” with boxes that use 


the electronic spin associated with a single nitrogen-vacancy (NV) 
defect centre in a diamond chip (Fig. 1b). The diamond chips are 
mounted in closed-cycle cryostats (T = 4 K) located in distant laborat- 
ories named A and B (Fig. 1c). We control the electronic spin state of 
each NV centre with microwave pulses applied to on-chip striplines 
(Fig. 1c, inset). The spins are initialized through optical pumping and 
read out along the Z axis via spin-dependent fluorescence’’. The read- 
out relies on resonant excitation of a spin-selective cycling transition 
(12-ns lifetime), which causes the NV centre to emit many photons when 
it is in the bright m, = 0 spin state, while it remains dark when it is in 
either of the m, = +1 states. We assign the value +1 (m, = 0) to the 
output if we record at least one photo-detector count during the read-out 
window, and the value —1 (m, = +1) otherwise. Read-out in a rotated 
basis is achieved by first rotating the spin, followed by read-out along Z. 

We generate entanglement between the two distant spins by entan- 
glement swapping” in the Barrett-Kok scheme” using a third loca- 
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tion C (roughly midway between A and B; see Fig. le). First we 
entangle each spin with the emission time of a single photon (time- 
bin encoding). The two photons are then sent to location C, where they 
are overlapped on a beam-splitter and subsequently detected. If the 
photons are indistinguishable in all degrees of freedom, then the obser- 
vation of one early and one late photon in different output ports 
projects the spins at A and B into the maximally entangled state 
lv) = (IN) —I1))/v2, where m,=0 = |1) and m,=—1 = ||). 
These detections herald the successful preparation and play the role of 
the event-ready signal in Bell’s proposed set-up. As can be seen in the 
space-time diagram in Fig. 2a, we ensure that this event-ready signal is 
space-like separated from the random input-bit generation at locations 
A and B. 

The separation of the spins by 1,280 m defines a 4.27-[1s time win- 
dow during which the local events at A and B are space-like separated 
from each other (see the space-time diagram in Fig. 2b). To comply 
with the locality conditions of the Bell test, the choice of measurement 
bases and the measurement of the spins should be performed within 
this time window. For the basis choice we use fast random-number 
generators with real-time randomness extraction’’. We reserve 160 ns 
for the random basis choice, during which time one extremely random 
bit is generated from 32 partially random raw bits (Supplementary 
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Figure 2 | Space-time analysis of the experiment. a, Space-time diagram 
of a single repetition of the entanglement generation. The x axis denotes the 
distance along the lines AC and CB. After spin initialization, spin-photon 
entanglement is generated, such that the two photons from A and B arrive 
simultaneously at C where the detection time of the photons is recorded. 
Successful preparation of the spins is signalled (bell symbol) by a specific 
coincidence detection pattern. Independent of the event-ready signal, the set- 
ups at locations A and B choose a random basis (RNG symbol), rotate the 
spin accordingly and start the optical spin read-out (measurement symbol). 
Vertical bars indicate durations. The event-ready signal lies outside the 
future light cone (coloured regions) of the random basis choices of A and B. 
b, Space-time diagram of the Bell test. The x axis denotes the distance along the 
line AB. The read-out on each side is completed before any light-speed signal 
can communicate the basis choice from the other side. The uncertainty in 
the depicted event times and locations is much smaller than the symbol size. 
c, Single-shot spin read-out fidelity at location A as a function of read-out 
duration (set by the latest time that detection events are taken into account). 
Blue (orange) line, fidelity of outcome +1 (—1) when the spin is prepared in 
m; = 0 (m, = £1); green line, average read-out fidelity; dotted line, read-out 
duration used (3.7 [ls). The inset shows the relevant ground and excited-state 
levels (not to scale). 


684 | NATURE | VOL 526 | 29 OCTOBER 2015 


Information). The random bit sets the state of a fast microwave switch 
that selects one out of two preprogrammed microwave pulses imple- 
menting the two possible read-out bases (Fig. 1c). Adding the dura- 
tions of each of the above steps yields a maximum time from the start 
of the basis choice to the start of the read-out of 480 ns. We choose the 
read-out duration to be 3.7 1s, which leaves 90 ns to cover any uncer- 
tainty in the distance between the laboratories and the synchronization 
of the set-up (estimated total error is at most 16 ns; see Supplementary 
Information). For this read-out duration, the combined initialization 
and single-shot read-out fidelity of sample A is (97.1 + 0.2)% (Fig. 2c); 
sample B achieves (96.3 + 0.3)%. In summary, the use of the event- 
ready scheme enables us to comply with the strict locality conditions of 
the Bell set-up by using photons to distribute entanglement, while 
simultaneously using the single-shot nature of the spin read-out to 
close the detection loophole. 

Before running the Bell test we first characterized the set-up and the 
preparation of the spin-spin entangled state. Figure 3a displays cor- 
relation measurements on the entangled spin-photon states to be used 
for the entanglement swapping. For both locations A and B we observe 
near-unity correlations between spin state and photon time bin when 
spin read-out errors are accounted for. We then estimate the degree of 
indistinguishability of the single photons emitted at locations A and B 
in a Hong-Ou-Mandel”' two-photon-interference experiment at loca- 
tion C, that is, after the photons have travelled through a combined 
length of 1.7 km of fibre. Using the observed two-photon interference 
contrast of 0.90+0.06 and the spin-photon correlation data, we 
estimate that the fidelity to the ideal state yy of the spin-spin entang- 
led states generated in our set-up is 0.92 + 0.03 (Supplementary 
Information). Combined with measured read-out fidelities, the gen- 
erated entangled state is thus expected to violate the CHSH-Bell 
inequality with S = 2.30 + 0.07. 

As a final characterization we ran the full Bell sequence including 
random number generation and fast read-out, but with co-linear mea- 
surement bases (ZZ and XX) such that spin-spin correlations could be 
observed with optimal contrast. To test the fast basis selection and 
rotation, the Z (X) basis measurements are randomly performed along 
the +Z (+X) and —Z (—X) axis. The observed correlations, shown in 
Fig. 3c (orange bars), are consistent with the estimated quantum state 
and the independently measured read-out fidelities (dotted bars), 
which confirms that the set-up is performing as expected and that 
the desired entangled state is generated. 

We find a success probability per entanglement generation attempt 
of about 6.4 X 107°, which yields slightly more than one event-ready 
signal per hour. Compared to our previous heralded entanglement 
experiments over 3m (ref. 29), this probability is reduced, mainly 
owing to additional photon loss (8 dBkm™') in the 1.7-km optical 
fibre. To ensure the required long-term operation, we exploit active 
stabilization on different relevant timescales via automated feedback 
loops (Supplementary Information). We note that the distance 
between the entangled electrons is nearly two orders of magnitude 
larger than it was in any previous experiment”"°””** with entangled 
matter systems. 

Using the results of the characterization measurements we deter- 
mine the optimal read-out bases for our Bell test. A numerical optim- 
ization yields the following angles for the read-out bases with respect to 
Z: 0 (for a = 0), + 2/2 (for a = 1), —3n/4 — ¢ (for b= 0) and 37/4 + € 
(for b= 1), with ¢ = 0.026. Adding the small angle ¢ is beneficial 
because of the stronger correlations in ZZ compared to XX. 
Furthermore, we use the characterization data to determine the time 
window for valid photon-detection events at location C to optimally 
reject reflected laser light and detector dark counts. We choose this 
window conservatively to optimize the entangled-state fidelity at the 
cost of a reduced data rate. These settings are then fixed and used 
throughout the actual Bell test. As a final optimization we replaced 
the photo-detectors at location C with the best set we had available. 
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Figure 3 | Characterization of the set-up and the entangled state. a, The 
probability to obtain spin state |1) at location A (left panel) or B (right panel) 
when a single photon is detected in the early or late time bin at location C. In the 
left (right) panel, only emission from A (B) was recorded. Dotted bars are 
corrected for finite spin read-out fidelity and yield remaining errors of 

1.4% + 0.2% (1.6% + 0.2%) and 0.8% + 0.4% (0.7% + 0.4%) for early and late 
detection events, respectively, from set-up A (B). These errors include imperfect 
rejection of the excitation laser pulses, detector dark counts, microwave-pulse 
errors and off-resonant excitation of the NV. b, Two-photon quantum 
interference signal, with dt the time between the two photo-detection events. 
When the NV centres at A and B emit indistinguishable photons, coincident 
detections of two photons, one in each output arm of the beam-splitter at C, are 
expected to vanish. The observed contrast between the cases of indistinguishable 
(orange) and distinguishable (grey) photons (3 versus 28 events in the central 
peak) yields a visibility of (90 + 6)% (Supplementary Information). 

c, Characterization of the Bell set-up using (anti-)parallel read-out angles. The 
spins at A (left arrows on the x axis) and B (right arrows on the x axis) are read 
out along the +Z axis (left panels) or the +X axis (right panels). The numbers in 
brackets are the raw number of events. The dotted lines represent the expected 
correlations on the basis of the characterization measurements presented in 

a and b (Supplementary Information). The data yield a strict lower bound” on 
the state fidelity to |) of 0.83 + 0.05. Error bars are 1 s.d. 
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Figure 4 | Loophole-free Bell inequality violation. a, Summary of the data 
and the CHSH correlations. The read-out bases corresponding to the input 
values are indicated by the green (for A) and blue (for B) arrows. Dotted 
lines indicate the expected correlation on the basis of the spin read-out 
fidelities and the characterization measurements presented in Fig. 3 
(Supplementary Information). Numbers above the bars represent the number 
of correlated and anti-correlated outcomes, respectively. Error bars shown are 


/ (1 —(x-y) iat) / Na,b)> With nq) the number of events with inputs (a, b). 


b, Statistical analysis for n = 245 trials. For the null-hypothesis test performed 
(Supplementary Information), the dependence of the P value on the J value is 
shown (complete analysis, red). Here [= 8(é _ 1), with k the number of times 
(-1)@ x. y = 1. (For equal n(,,), J = S with S defined in equation (1).) A 
small P value indicates strong evidence against the null hypothesis. We find 

k = 196, which results in a rejection of the null hypothesis with a P = 0.039. For 
comparison, we also plot the P value for an analysis (conventional analysis, 
orange) assuming independent and identically distributed (i.i.d.) trials, 
Gaussian statistics, no memory and perfect random-number generators. 


We ran 245 trials of the Bell test during a total measurement time of 
220 h over a period of 18 days. Figure 4a summarizes the observed data, 
from which we find S = 2.42, in violation of the CHSH-Bell inequality 
S=2. We quantify the significance of this violation for two different 
scenarios (see Fig. 4b). First, similar to previous work*”, we analyse the 
data under the assumptions that the Bell trials are independent of each 
other, that the recorded random input bits have zero predictability and 
that the outcomes follow a Gaussian distribution. This analysis (which 
we term ‘conventional’) yields a standard deviation of 0.20 on S. In this 
case, the null hypothesis that a local-realist model for space-like sepa- 
rated sites describes our experiment is rejected with a P value of 0.019 
(see Supplementary Information). 

The assumptions made in the conventional analysis are not justified 
in a typical Bell experiment. For instance, although the locality condi- 
tions outlined earlier are designed to ensure independent operation 
during a single trial, the boxes can in principle have access to the entire 
history including results from all previous trials and adjust their output 
to it’®*', Our second analysis (which we term ‘complete’) allows for 
arbitrary memory, takes the partial predictability of the random input 
bits into account and also makes no assumption about the probability 
distributions underlying the data (see Supplementary Information). In 
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this case, the null hypothesis that an arbitrary local-realist model of 
space-like separated sites governs our experiment is rejected with a P 
value of 0.039 (Fig. 4b). This P value might be further tightened in 
future experiments. 

Our experiment realizes the first Bell test that simultaneously 
addresses both the detection loophole and the locality loophole. 
Being free of the experimental loopholes, the set-up tests local-realist 
theories of nature without introducing extra assumptions such as fair 
sampling, a limit on (sub-)luminal communication or the absence of 
memory in the set-up. Our observation of a statistically significant 
loophole-free Bell inequality violation thus indicates rejection of all 
local-realist theories that accept that the number generators produce a 
free random bit in a timely manner and that the outputs are final once 
recorded in the electronics. 

Strictly speaking, no Bell experiment can exclude all conceivable 
local-realist theories, because it is fundamentally impossible to prove 
when and where free random input bits and output values came into 
existence’’. Even so, our loophole-free Bell test opens the possibility to 
progressively bound such less-conventional theories: by increasing the 
distance between A and B (for example, to test theories with increased 
speed of physical influence); by using different random input bit gen- 
erators (to test theories with specific free-will agents, for example, 
humans); or by repositioning the random input bit generators (to 
test theories where the inputs are already determined earlier, some- 
times referred to as ‘freedom-of-choice”). In fact, our experiment 
already enables tests of all models that predict that the random inputs 
are determined a maximum of 690 ns before we record them 
(Supplementary Information). 

Combining the presented event-ready scheme with higher entang- 
ling rates (for example, through the use of optical cavities) provides 
prospects for the implementation of device-independent quantum key 
distribution” and randomness certification”***. In combination with 
quantum repeaters, this might enable the realization of large-scale 
quantum networks that are secured through the very same counter- 
intuitive concepts that inspired one of the most fundamental scientific 
debates for 80 years'*”. 
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Reactions based on transition metals have found wide use in 
organic synthesis, in particular for the functionalization of small 
molecules’”. However, there are very few reports of using trans- 
ition-metal-based reactions to modify complex biomolecules**, 
which is due to the need for stringent reaction conditions (for 
example, aqueous media, low temperature and mild pH) and the 
existence of multiple reactive functional groups found in biomo- 
lecules. Here we report that palladium(m) complexes can be used 
for efficient and highly selective cysteine conjugation (bioconjuga- 
tion) reactions that are rapid and robust under a range of bio- 
compatible reaction conditions. The straightforward synthesis of 
the palladium reagents from diverse and easily accessible aryl 
halide and trifluoromethanesulfonate precursors makes the 
method highly practical, providing access to a large structural 
space for protein modification. The resulting aryl bioconjugates 
are stable towards acids, bases, oxidants and external thiol nucleo- 
philes. The broad utility of the bioconjugation platform was fur- 
ther corroborated by the synthesis of new classes of stapled 
peptides and antibody-drug conjugates. These palladium com- 
plexes show potential as benchtop reagents for diverse bioconjuga- 
tion applications. 

Post-translational modifications greatly expand the function of pro- 
teins’. Chemists aim to mimic the success of such natural transforma- 
tions through the development of chemo- and regioselective reactions 
of proteins. The diversity of potentially reactive functional groups 
present in biomolecules (for example, amides, acids, alcohols, amines) 
combined with the requirement for fast kinetics and mild reaction 
conditions (for example, aqueous solvent, pH 6-8, temperature 
T < 37°C) make challenging the development of new techniques to 
functionalize proteins. Nevertheless, methods have emerged for bio- 
conjugation with natural and unnatural amino acids in protein mol- 
ecules®’. Cysteine is a key residue for the chemical modification of 
proteins owing to the unique reactivity of the thiol functional group 
and the low abundance of cysteine residues in naturally occurring 
proteins*”. Michael addition to maleimides and $2 reaction with alkyl 
halides are commonly used for cysteine modification. The resulting 
conjugates tend to decompose in the presence of external bases or thiol 
nucleophiles’’, which prompted the recent development of advanced 
cysteine bioconjugations for the improved stability of conjugates (see 
ref. 11, and references therein). 

The ability to achieve high levels of chemo- and regioselectivity 
through the judicious choice of metal and ligand design suggest that 
metal-mediated processes could be very attractive for the development 
of new bioconjugations. Existing metal-based transformations often 
rely on the use of functional handles’? or unnatural amino acids, such 
as 4-iodophenylalanine and aldehyde- or alkyne-containing amino 
acids**"*, and require high concentrations (mM) of derivatizing 
agents, which can cause off-target reactivity or purification problems. 
We considered that palladium complexes resulting from the oxidative 
addition of aryl halides or trifluoromethanesulfonates"* could be used 


for the transfer of aryl groups to cysteine residues in proteins (Fig. la). 
(For existing transition-metal-catalysed C-S bond-forming reactions, 
see ref. 15.) The efficiency and selectivity of the proposed reaction with 
the highly active palladium species may be hampered by the presence 
of a variety of functional groups within complex biomolecules. 
Furthermore, the presence of free thiols has been previously shown 
to inhibit palladium-catalysed cross-coupling reactions on peptides"®, 
while Pd(11)-complexes have also been shown to exhibit protease-like 
behaviour with certain peptides’’. However, we envisioned that the 
careful choice of ligand would provide stable, yet highly reactive 
reagents for the desired transformations (Fig. 1b), while the interaction 
between the soft nucleophile cysteine thiol and the aryl palladium() 
species would guide its selectivity. 

We began our study with a palladium-tolyl complex (1A-OTf; 
Fig. 1c) using 2-dicyclohexylphosphino-2’,6’-diisopropoxybiphenyl 
(RuPhos) as the ligand and trifluoromethanesulfonate as the counter- 
ion. A model peptide (P1; Fig. 1c) was used for the optimization of the 
reaction conditions and for the exploration of the substrate scope. Full 
conversion of the starting peptide to the corresponding aryl product 
was observed in less than 5 min at low micromolar concentrations of 
reagents (Fig. 1c). Further, the reaction was selective for cysteine. No 
reaction was observed using a control peptide with cysteine mutated 
to serine (Supplementary Information), in contrast to the palladium- 
mediated protein allylation, which is selective for tyrosine (O-allyla- 
tion) over lysine and cysteine (N- and S-allylation). (The new 
aryl-palladium reagents are less electrophilic than the allylpalladium 
species used in ref. 18, which tunes their selectivity towards cysteine 
residues while making them completely unreactive towards alcohol- 
based species like tyrosine.) These results highlight the importance of 
choosing the right ligand, which facilitates C-S reductive elimination 
and, together with the overall electrophilicity of the palladium centre, 
tunes the selectivity of the transformation. 

Most cysteine conjugation reactions operate at nearly neutral to 
slightly basic pH. Further evaluation of the reaction conditions using 
palladium reagents revealed quantitative conversion of the starting 
peptide to the corresponding S-aryl cysteine conjugate within a broad 
pH range (5.5-8.5) using common organic co-solvents (5% of 
N,N-dimethylformamide (DMF), dimethylsulfoxide (DMSO), aceto- 
nitrile (CH3;CN)) in various buffers (Supplementary Information, 
Supplementary Table 1). Remarkably, even in 0.1% trifluoroacetic acid 
(TFA) solution (pH 2.0) the reaction yielded 59% of the S-arylated 
product after 7h. The process was also compatible with the protein 
disulfide reducing agent tris(2-carboxyethyl)phosphine (TCEP) that 
has been shown to hamper bioconjugations by reacting with malei- 
mide and «-haloacyl groups”. 

The palladium-mediated conjugation is fast, and complete product 
formation occurs within 15s at 4°C. The reaction rate was estimated 
by competition experiments against the commonly used N-methyl 
maleimide cysteine ligation. (The reported”® kinetics for maleimide 
conjugation are in the range 10°-10* M_'s ' at pH_7.5.) At pH 7.5 
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Figure 1 | Organometallic palladium reagents for cysteine modification. 
Figure shows general strategy and model studies. a, Proposed cysteine 
bioconjugation using palladium reagents. b, X-ray structure, storage and 
operational simplicity of palladium reagents for cysteine bioconjugation. Left, 
crystals of LA-OTf for X-ray diffraction were obtained by vapour diffusion 
of a diethyl ether/acetonitrile solution of 1A-OTf with pentane. Thermal 
ellipsoid plots are drawn at 50% probability, hydrogen atoms are omitted 

for clarity. Middle, general structure of palladium reagents. RuPhos, 
2-dicyclohexylphosphino-2',6'-diisopropoxybiphenyl. Right, the complexes 
were stored in screw-cap vials under air and weighed out on the bench top. 


the rate of the palladium-mediated reaction was comparable to that of 
the maleimide ligation, where 70% of the products resulted from the 
reaction with palladium-tolyl complex (1A-OTf). Notably, the pal- 
ladium-mediated conjugation outperformed the maleimide ligation at 
pH 5.5 where only the arylated product was formed. 

The optimized conditions (0.1 M Tris buffer, 5% CH3CN, pH 7.5, 
room temperature) were used for further evaluation of the substrate 
scope. Palladium complexes containing chloride, bromide and iodide 
counterions were all found to produce the desired product (Fig. 2, 1A- 
Cl, 1A-Br and 1A-I). This method can be used to functionalize 
unprotected peptides with a variety of important groups, including 
fluorescent tags (1C, 1D), affinity labels (1E), bioconjugation handles 
(aldehyde 1F, ketone 1G and alkyne 1H) and photochemical cross- 
linkers (11), as well as complex drug molecules (1J). Vinyl palladium 
complexes were also shown to be competent in this transformation 
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c, Model reaction with a peptide substrate (top) and the liquid chromatography 
mass spectrometry (LC-MS) trace of the crude reaction mixture after 5 min 
(bottom). The mass spectrum of the arylated product is shown in the inset. 
Sequence of peptide P1: NH;-RSNFYLGCAGLAHDKAT-C(O)NH>. The 
reaction was quenched by the addition of 3-mercaptopropionic acid (3 equiv. 
to 1A-OTf) before LC-MS analysis. At high reaction concentrations 

(2100 uM) a cloudy precipitate formed after the addition of palladium reagent 
presumably owing to low solubility of the complex in the aqueous solvent. 
These reactions still produced the desired bioconjugate in high yields 
(Supplementary Information). 


(Supplementary Information). Importantly, the palladium(m) com- 
plexes are stable under ambient conditions, and can be stored in closed 
vials under air at 4 °C for over four months. Long-term stability of 1A- 
I, 1A-Br, 1A-Cl and 1A-OTf was evaluated; only the complex bear- 
ing the trifluoromethanesulfonate counterion (1LA-OTf) showed some 
degradation (=15%) after 20 weeks (Supplementary Information). 
Nevertheless, the ‘aged’ reagent still exhibited reactivity comparable 
to the freshly made complex. 

The stability of our arylated peptides was compared to that of con- 
jugates formed from reactions with reagents including N-ethyl malei- 
mide, 2-bromoacetamide and benzyl bromide. The S-arylated peptide 
was shown to be stable towards acids, bases and external thiol nucleo- 
philes (Supplementary Information). In contrast, the corresponding 
acetamide derivative was unstable under acidic and basic conditions, 
and the maleimide conjugate decomposed in the presence of base and 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


0.1 M Tris (pH 7.5) 
CH,CN:H,0, 5:95 


Pg—-Ar  ———_______> H,N—~ T “—C(O)NH 
5 min 
x 


HS Pi then SAr 
10 uM 20-60 uM HS \- CO2H Quantitative 
"4 : > 
' Fluorescent tags ) ‘ Affinity tag ) 
O OH 
acce Peeg ha ee ‘ 
Oo * Z | HN omy 
OO : : 
; roy Ss 
H oO : 
; 1c 1D Le 
1A-CLX=Cl =! X = OTf X = OTf ‘ A= Ol 
1A-Br, X = Br : ‘ 
4A-I, X =| ' Bioconjugation handles ! Drug molecules ) 
MeN oe 
mt O N 
ie oe aD 1J "| 
Me~ ~N* Et : ; X= Br MeO 7 
' H Vandetanib 
t ' NH 
1B ‘ 1F, 1G, 1H 
7 : X=Cl il ‘(kinase inhibitor 
A= Ol : R=H, Me, Ph X=Cl : of cell receptors) “ F 
“ ' H 


Figure 2 | The substrate scope of cysteine arylation using organometallic 
palladium reagents. Top, the reaction studied. Bottom, a selection of 
palladium reagents was used to test the effect of the leaving group (X) on the 
reactivity and explore the substrate scope with regard to biologically relevant 


exogenous thiol. Finally, comparable stability of both aryl and benzyl 
conjugates to treatment with the periodic acid oxidant at 37 °C was 
observed. However, additional tuning of the electronic properties of 
the aromatic ring of the arylated peptide could be achieved by instal- 
ling an electron-withdrawing cyano group in the para position. This 
modification significantly decreased the amount of oxidation, pro- 
ducing the most stable peptides across all the evaluated conjugates. 


groups (fluorescent tags, bioconjugation handles, affinity tag and a drug 
molecule). Full conversion of starting peptide P1 into the corresponding 
arylated products was observed in all the cases shown, as confirmed by LC-MS. 
For exact reaction procedures and conditions, see Supplementary Information. 


Notably, installing the para cyano group in the benzyl conjugates had 
no effect on oxidation (Supplementary Information). 

We further explored this reaction with proteins. Three antibody 
mimetic proteins’ (P4—P6; Fig. 3) were expressed that contained a 
cysteine at structurally distinct positions including the amino- 
terminus, the carboxy-terminus and a loop. The same proteins without 
cysteines were used as controls to confirm the selectivity of the reaction 
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Figure 3 | Protein modification using the developed palladium reagents. 
Cysteine residues at the N terminus (P4; a), a loop (P5; b) and the C terminus 
(P6; c) of proteins are quantitatively modified with coumarin after the reaction 
with palladium complex 1D (top). Deconvoluted mass spectra of the full 
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protein peaks are shown for the starting proteins (P4—P6) and reactions with 
coumarin-palladium complexes after 30 min (bottom). Three-dimensional 
structures of proteins P4—P6 and the arylated products (P4-D, P5-D and 
P6-D) are presented next to the corresponding mass spectra. 


29 OCTOBER 2015 | VOL 526 | NATURE | 689 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


HS 


10 uM 
— Stapling reagents R 
0.5 mM Tris (pH = 7.5) 
CH,CN:H,O, 1:1 
10 min 
then 
HS ~_ CO3H 
2 


ie) 


ae 
za 


— 
S) iS} 
H,N-—lle-Lys-Phe-Thr-Asn— pe eee \ w 


Figure 4 | Peptide stapling and antibody drug conjugate formation using 
palladium-based reagents. a, Stapling of a model peptide using bis-palladium 
reagent 2A. Reaction conditions: P3 (10 1M), complex 2A (20 |1M), Tris buffer 
(50 mM; pH 7.5), CH3CN:H20 = 1:1, room temperature, 10 min. The reaction 
was quenched by the addition of 3-mercaptopropionic acid (6 equiv. to 

2A) before the LC-MS analysis. b, Synthesis of ‘linker-free’ antibody—drug 


(P7-P9). All three proteins (P4—P6) were quantitatively tagged with 
either coumarin (Fig. 3) or a drug molecule (Supplementary 
Information) within 30 min at 1 1M protein concentration. No ary- 
lated product was generated for proteins lacking a cysteine. The pres- 
ence of small amounts of organic co-solvents is required for efficient 
bioconjugation, and lower product yields were observed when <5% of 
the co-solvent was used (Supplementary Information). The fast kin- 
etics and high efficiency of the reactions at low micromolar and low 
nanomolar (Supplementary Information) protein concentrations are 
in contrast to reported bioconjugation methods using organometallic 
reagents, where longer reaction times were needed and generally lower 
conversions were observed’. 

The developed protocol was used to arylate an engineered cysteine 
residue in the C-terminal region of diphtheria toxin A-chain (DTA) 
fused to the lethal factor N-terminal domain (LFy-DTA-Cys, see 
Supplementary Information)”. The modified LFy-DTA-Cys variant 
was readily separated from the remaining palladium species, ligands 
and other small molecules using commercially available size-exclusion 
chromatography columns (91% of palladium was removed, as deter- 
mined by inductively coupled plasma mass spectrometry (ICP-MS) 
analysis of the purified protein sample, see Supplementary 
Information). The modified and purified LFy-DTA-Cys variant dis- 
played similar activity (half-maximal effective concentration 
ECs9 = 0.40 + 0.09 nM) in a cell-based protein synthesis inhibition 
assay compared to the control (a serine mutant LFy-DTA-Ser showed 
ECs9 = 0.25 + 0.05 nM) (Supplementary Information). 

Stapled peptides have shown significant promise as next-generation 
therapeutics™*”°. However, there are limited methods for the synthesis 
of these bioconjugates with structurally diverse linkers*®, which hinders 
the systematic investigation of the effect of the linker on the properties 
of the stapled peptides”’. We envisioned that palladium reagents con- 
taining two electrophilic metal centres could be efficiently used to cross- 
link two cysteine residues on a peptide chain, thereby providing access 
to stapled peptides with various aryl linkers (Fig. 4a). Indeed, running 
the reaction at 10 1M concentration of peptide in a 1:1 (v/v) acetoni- 
trile/water solvent mixture at pH 7.5 using a twofold excess of the 
bis-palladium complex 2A resulted in quantitative formation of the 
target stapled peptide within 10 min (Fig. 3a and Supplementary 
Information). Considering the availability of commercially or other- 
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conjugates (ADCs). Trastuzumab antibody is represented as the two-tone grey 
structure at the top. Deconvoluted mass spectra of the fully reduced and 
deglycosylated ADCs are shown. Drug-to-antibody ratio (DAR) represents the 
average number of drugs per antibody. See Supplementary Information for 
details. 


wise easily accessible diarylhalide reagents, this approach provides 
facile access to a diverse aryl-linker space for stapled peptides”. 

Antibody-drug conjugates (ADCs) are a promising class of biother- 
apeutics, which combine the potency of cytotoxic drugs with the target 
specificity of monoclonal antibodies”. We aimed to attach drug mol- 
ecules directly to cysteine residues in antibodies through the developed 
palladium conjugation chemistry. The drug payload vandetanib was 
used to form palladium complex 1J (Fig. 2) by making use of the aryl 
bromide present in its structure. Treating partially reduced trastuzumab 
antibody” with 1J readily produced ADCs with a 4.4 drug to antibody 
ratio (DAR; Fig. 4b). The purified arylated ADCs (94% of palladium was 
removed, as determined by ICP-MS analysis of the purified ADCs, see 
Supplementary Information) retained binding affinity (dissociation con- 
stant Kp =0.3+0.2nM) to recombinant HER2 compared to the 
unmodified trastuzumab antibody (Supplementary Information). 
While traditional ADCs use various linkers to attach drug molecules 
to antibodies, our method significantly expands the structural space of 
ADCs by providing the capability to directly attach drug molecules 
containing native or pre-installed aryl halide or phenol functional 
groups. The therapeutic potential of this class of ‘linker-free’ ADCs will 
be investigated in the future. 

We note that the ease of preparation, storage and use of the present 
palladium reagents make them particularly attractive for routine 
application in chemistry, biology, medicine and materials science. 
Further evolution of the metals and ligands employed will probably 
provide an extended set of organometallic bioconjugation reagents 
with altered selectivity and efficiency, allowing for functionalization 
of other amino acid residues. 
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Decadal slowdown of a land-terminating sector 
of the Greenland Ice Sheet despite warming 


Andrew J. Tedstone', Peter W. Nienow!, Noel Gourmelen!, Amaury Dehecq!?, Daniel Goldberg’ & Edward Hanna? 


Ice flow along land-terminating margins of the Greenland Ice 
Sheet (GIS) varies considerably in response to fluctuating inputs 
of surface meltwater to the bed of the ice sheet. Such inputs lub- 
ricate the ice-bed interface, transiently speeding up the flow of 
ice’”. Greater melting results in faster ice motion during summer, 
but slower motion over the subsequent winter, owing to the evolu- 
tion of an efficient drainage system that enables water to drain 
from regions of the ice-sheet bed that have a high basal water 
pressure**. However, the impact of hydrodynamic coupling on 
ice motion over decadal timescales remains poorly constrained. 
Here we show that annual ice motion across an 8,000-km? land- 
terminating region of the west GIS margin, extending to 1,100 m 
above sea level, was 12% slower in 2007-14 compared with 1985- 
94, despite a 50% increase in surface meltwater production. Our 
findings suggest that, over these three decades, hydrodynamic 
coupling in this section of the ablation zone resulted in a net slow- 
down of ice motion (not a speed-up, as previously postulated’). 
Increases in meltwater production from projected climate warm- 
ing may therefore further reduce the motion of land-terminating 
margins of the GIS. Our findings suggest that these sectors of the 
ice sheet are more resilient to the dynamic impacts of enhanced 
meltwater production than previously thought. 

The GIS is losing mass at an accelerating rate*’, as a result both 
of increased surface melting® and of enhanced ice discharge from 
accelerating marine-terminating glaciers’. Enhanced melting accounts 
for about 60% of the GIS mass lost since 2000 (ref. 5); summer 
air temperatures over the southwest GIS warmed by 0.9°C during 
1994-2007 (ref. 7), and the amount of meltwater produced during 
the summers of 2007-12 (except 2009) is without precedent in the 
past 50 years of reanalysis-forced reconstructions®. During 1993-2012, 
the average annual melt doubled from that which occurred during 
1961-90 (ref. 8). 

While the acceleration of marine-terminating glaciers is believed to 
be driven primarily by processes operating at the ice—ocean interface, 
atmospheric forcing can also change ice motion at both land- and 
marine-terminating glaciers through the delivery of surface meltwater 
to the ice-sheet bed’”. Surface meltwaters can drain rapidly to the ice- 
sheet bed via moulins and supraglacial lake drainage events, both of 
which provide direct surface-to-bed connectivity and a mechanism by 
which surface meltwater could influence basal motion’ ’. It has been 
suggested that this mechanism could lead to a positive feedback 
between enhanced surface meltwater production and ice-sheet 
motion, as ice would move more quickly to lower elevations, where 
temperatures are warmer”. 

Other studies have highlighted the importance of the subglacial 
drainage system in controlling the relationship between surface 
melting and ice motion through changes in system capacity and 
morphology’*”*. During summer, rapid increases in meltwater from 
the ice-sheet surface result in periods in which the subglacial drainage 
system is more highly pressurized, leading to transient periods when 
the water pressure below exceeds the pressure caused by the weight of 


the ice above, resulting in enhanced basal sliding’. However, the capa- 
city of the subglacial drainage system then increases in response'*””, 
introducing a negative feedback that lowers the water pressure and 
reduces basal sliding’*'*. By the end of summer, an efficient drainage 
system has evolved upglacier'*"*, draining surrounding regions of the 
ice-sheet bed that were previously hydraulically isolated. This reduces 
basal lubrication during the subsequent winter, counteracting the 
summer speed-up and making net annual ice motion relatively insens- 
itive to summer melting””. 

Despite these advances in understanding the coupled hydrodyn- 
amics of ice-sheet flow, it remains unclear whether enhanced surface 
melting has a long-term impact on annual ice motion. Eight global 
positioning system (GPS) stations on a transect extending 130km 
inland in the southwest GIS showed an average 10% decrease in ice 
flow from 1991 to 2007, during a period in which surface melt 
increased markedly, but there was considerable spatial variability’. 
The slowdown trend at lower sites continued into 2012 (ref. 20). 
Meanwhile, from 2009 to 2012 a small acceleration signal was observed 
above the altitude of the equilibrium line (where annual ice accumula- 
tion is equal to annual ablation), at ~1,500 metres above sea level 
(m.a.s.L; ref. 21). The parametrization of basal lubrication in higher- 
order ice-sheet models, using observations from southwest Greenland, 
suggests that basal lubrication is unlikely to increase the contribution 
of the ice sheet to sea-level rise by more than 5% of the contribution 
that would be expected from a negative surface mass budget alone, and 
could conceivably act as a negative feedback upon ice motion”. 

Here we present observations of annual GIS motion spanning three 
decades, which extend back to 1985. Our ~8,000-km” study area 
extends along about 170 km of the predominantly land-terminating 
margin of the west GIS, to ~50km inland, and to ~1,100 mas. 
(Fig. 1). We apply feature tracking (see Methods) to 475 pairs of 
remotely sensed optical Landsat images separated by approximately 
one year’*. Next, we derive robust ice-motion and uncertainty esti- 
mates over periods of approximately one to two years from 1985 to 
2014 (Fig. 2b and Extended Data Fig. 4), and over multiyear reference 
periods spanning, first 1985-94, capturing the period before air tem- 
peratures began to warm’; and second 2007-14, corresponding to the 
recent series of summers with record melting’. 

Ice motion shows a clear regional slowdown (Fig. 1), with 84% of the 
study area flowing more slowly in 2007-14 than in 1985-94 (Fig. 1a). 
On average, ice motion slowed by 12% across the study area. 
Slowdown was strongest (~ 15-20%) at elevations below around 800 
m.a.s.l. (Fig. 1b). Isolated areas experienced speed-up in 2007-14 com- 
pared with 1985-94. In the far northeast, the speed-up can probably be 
attributed to the dynamics of the neighbouring marine-terminating 
Jakobshavn Isbrae, which, like many of Greenland’s marine-termin- 
ating glaciers, has accelerated since the mid-1990s (ref. 5). 

We can divide the ice-motion record (Fig. 2b) into two statistically 
significant periods (see Methods). Segmented linear regression 
(R* = 0.79) shows that there was no significant trend in ice motion 
during 1985-2002 (P= 0.85). The slowdown in motion probably 
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Figure 1 | Study area in the ablation zone of the western GIS. In the main 
figure, the colour scale shows the percentage change in ice velocities during the 
2007-14 reference period compared with the 1985-94 reference period (see 
main text). Transects A, B and C correspond to data in Fig. 3. Ice surface 
contours (pale grey lines) are from ref. 30. Grey shading denotes areas where ice 
velocities cannot be resolved; green shading denotes land areas; light blue 
shading denotes inland and coastal waters. Inset: a, percentage changes in ice 
velocities in 4% bins; b, median percentage change in each 100-m elevation 
band between 400 m.a.s.l. and 1,100 m.a.s.l., +1o (see Methods). 


began around 2002, and, despite interannual variability, there was a 
robust overall trend of —1.5myr * during 2002-14 (P<0.01). 
Meanwhile, we can divide surface meltwater production (Fig. 2a) into 
three statistically significant periods (see Methods): a sustained ‘low 
melt of 2.1 water equivalent (w.e.) myr * during 1985-93; a rising 
melt during 1993-2002; and a sustained high melt of 3.2 w.e. myr 
during 2002-14, coincident with the slowdown in ice motion. Overall 
there was a 49.8% rise in surface meltwater production across our 
study area between 1985-94 and 2007-14. 

We explored temporal variability in ice motion along three transects 
(Fig. 1), selected to represent different ice-marginal conditions. 
Transect A extends 80 km inland from the Nordenskjéld glacier, which 
has a lacustrine-terminating margin; transect B extends about 30km 
inland from a land-terminating margin; and transect C extends about 
30km inland from the marine-terminating Alangordliup sermia. 
Transects A and B slowed down during 2000-14 to attain velocities, 
averaged along the transect, that were respectively 19% and 18% slower 
in 2013-14 than during 1985-94 (Fig. 3a, b). Ice-motion characteristics 
at the marine-terminating transect C were more complex (Fig. 3c). The 
transect slowed on average from the mid-2000s to 2014, although ice 
motion within 10 km of the margin sped up in the late 2000s following 
earlier slowdown, and by 2013-14 was flowing up to about 50m yr * 
faster than during the 1985-94 reference period. Such behaviour is in 
line with other tidewater glaciers that have recently accelerated’. 
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Figure 2 | Surface melting and ice motion averaged over the study area. 

a, Annual mean modelled surface melt (grey), smoothed with a five-year 
moving mean (black), both in water equivalent (w.e.) m per year (see Methods). 
b, Median ice velocities during each period (black boxes) calculated using the 
common sampling pixels across the time series, +10 (see Methods). The width 
of each box corresponds to the total timespan of the pairs of Landsat images 
acquired during each period. The height of each box corresponds to + 1o (see 
Methods). Blue and red lines illustrate the trends in ice velocity computed by 
segmented linear regression weighted by lo. ¢, Altitudinal distribution of the 
common sampling pixels used to compute the velocities in b. 


The slowdown signal across our predominantly land-terminating 
region extends up to about 1,100 m.a.s.l. (Fig. 1), where the mean ice 
thickness is roughly 850m (ref. 24). The clear deceleration in ice 
motion requires a decrease in rates of either internal ice deformation, 
or basal motion, or both. Melting has caused marginal thinning of the 
GIS***’, During 1993-98, land-terminating glaciers on the west GIS 
margin thinned by 0.02-0.23 myr ' below 1,000 m.a.s1. (ref. 25). Our 
study area thinned by about 0.2myr | during 2003-07 (ref. 26) and 
this rate increased to 1-1.5myr_‘ during 2011-14 (ref. 27). We mod- 
elled the velocity change that would be caused by 10-20m of ice 
thinning (and the associated gradient changes) along transect A over 
the 1985-2014 study period (see Methods), corresponding to a max- 
imum thinning rate of about 0.6 m yr. The resulting change in driv- 
ing stress can explain only around 17-33% of the observed overall 12% 
slowdown signal beyond 10 km from the ice-sheet margin, and can 
explain none of the slowdown beyond 50km from the margin 
(Extended Data Fig. 5c). Thus, while a component of the observed 
slowdown can be explained by changes in driving stress through 
ice thinning, the majority of the slowdown (that is, the remaining 
67-83%) must be the result of processes operating at the ice-bed 
interface that cause a reduction in basal motion. 

Previous studies have suggested that the coupling between surface 
melting and basal motion is self-regulating, such that there is no stat- 
istically significant relationship between melting and ice motion over 
annual timescales*”®. In agreement with these studies, we find no 
relationship between annual melt volume and annual ice motion 
(R? = 0.08). There is, however, a significant relationship between ante- 
cedent melt volumes and ice motion (Extended Data Table 1). The 
mean melt volume from each observation period and the previous year 
combined explain 23% of ice motion (P< 0.05), increasing to 44% 
when the previous four years of melt are included. Moreover, melt 
volumes explain 50% of ice motion when the mean melt volume is 
calculated using only the previous three years’ data (P< 0.01). 

We therefore suggest that sustained high production of surface 
meltwater (Fig. 2a) is responsible for the sowdown. Observations from 
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Figure 3 | Ice velocities along three transects in the study area. Results for 
a, transect A; b, transect B; c, transect C. These transects are shown in Fig. 1. 
Only periods in which ice velocities are observed along at least 60% of each 
transect are shown. Velocities during the 1985-94 reference period are also 
shown. 


the GIS show that during the melt season, the large cumulative increase 
in the rate of meltwater supply to the ice-sheet bed results in the 
expansion of a channelized subglacial drainage system’®, even beneath 
ice that is around 1 km thick"’. As air temperatures warm, more melt- 
water at higher elevations allows an efficient drainage system to evolve, 
which extends further into the ice sheet. Summers of extreme melt 
result in a higher-capacity, more-extensive channelized drainage sys- 
tem, which therefore stays open at atmospheric pressure for longer 
after the melt ceases**. Dye tracing of alpine glaciers” indicates that 
water transit speeds through unchannelized drainage systems are 
~0.01ms ' (~850md_'). Thus, while channels at atmospheric 
pressure beneath ice that is about 1km thick close within hours to 
days"', channels that stay open for longer, for example for just two days 
as opposed to one, have the capacity to evacuate notably more water 
from surrounding linked but unchannelized regions of the ice-sheet 
bed, causing more widespread dewatering of the ice-bed interface. 
Sources of meltwater, including frictional melting by basal slip and 
geothermal heat, will enable the water pressure to recover gradually 
through the subsequent winter, but may be insufficient to replace the 
stored waters evacuated during the previous melt season. 

Previous observations have illustrated the importance of changes in 
the connectivity between channelized and unchannelized regions of 
the ice-sheet bed in controlling ice velocities late in the melt season’’. 
We postulate that unchannelized drainage regions and their connec- 
tivity to the channelized drainage system govern ice motion not only 
late in the melt season, but also during the following winter and spring. 
We suggest that, if increases in drainage efficiency occur year-on-year, 
gradual net drainage of water stored in unchannelized regions of 
the ice-sheet bed will result in reduced basal lubrication and net ice 
slowdown. Additional field observations—such as borehole arrays, 
transverse to subglacial channels, recording water pressure gradients 
(see, for example, ref. 29), together with hydrological modelling 
(see, for example, ref. 14)—are required to test the robustness of our 
hypothesis. Furthermore, while melt-driven seasonal evolution in sub- 
glacial drainage can affect the flow of tidewater glaciers’, the ongoing 
acceleration of these glaciers’ during a period of warming, in contrast 
to our observations, suggests that other processes are controlling 
their dynamics. 
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Our observations of GIS ice motion from three decades provide 
conclusive evidence that a 50% rise in meltwater production has not 
led to ice speed-up along a land-terminating margin; instead, average 
annual ice motion slowed by more than 15% at elevations below 800 
m.a.s.l, and probably by at least 5% at elevations up to 1,100 m.a.s.l. 
Only about 17-33% of the slowdown can be explained by reduced 
internal deformation caused by ice thinning, and we therefore propose 
that, since 2002, increases in subglacial drainage efficiency associated 
with sustained larger melt volumes have reduced basal lubrication, 
resulting in slower ice flow. It remains unclear whether the observed 
slowdown occurs at elevations above 1,100 m.a.s.l., and whether the 
slowdown will migrate inland as enhanced melting extends to higher 
elevations and allows a more extensive efficient subglacial drainage 
system to evolve. Furthermore, our findings relate to land-terminating 
margins, but the forcing mechanisms that have driven the recent 
speed-up of many tidewater glaciers remain poorly understood*”® 
and require a similar examination of annual ice motion over decadal 
timescales. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Remote sensing of ice motion. We applied feature-tracking techniques to 
extract ice motion from Landsat Program imagery. Landsat images were obtained 
from the US Geological Survey (via the Earth Explorer catalogue at http:// 
earthexplorer.usgs.gov) and the European Space Agency (via the Earth 
Observation Link catalogue at https://earth.esa.int/web/guest/eoli). Here we pro- 
vide an outline of the processing strategy and the specific parameters that we used. 
A detailed description of the processing strategy is available elsewhere”’. Our 
approach builds individual annual velocity fields from feature tracking of 
Landsat pairs, overlapping in time and space. These velocity fields are then com- 
bined over interannual time periods in order to increase the robustness of the 
velocity estimates and to enable statistical determination of uncertainties. 

We used images from the Landsat 5, 7 and 8 missions; the quality and quantity 
of images from Landsat missions 1-3 were insufficient to permit their use. We 
identified 475 image pairs with temporal baselines between 352 and 400 days, 
acquired from April to October over the 1985-2014 study period (see 
Supplementary Information). The temporal baseline of ~1 year was chosen to 
minimize the impact of seasonal flow variability upon interannual trends in ice 
velocity, as we are specifically interested in long-term changes in ice motion. 

To enhance the images before feature tracking, we used principal component 
analysis to combine the optimum spectral bands, identified during testing as bands 
2 and 3 for each satellite mission. A high-pass filter (using Sobel kernels) was used 
to compute the intensity gradients of each image, enhancing surface features such 
as crevasses and reducing the impact of features related to basal topography, which 
by definition are temporally stable. We then used the gradients as features to be 
tracked. The tracking (used to extract ice displacement) was performed on match- 
ing windows of 44 pixels (1,320 m) anda grid spacing of 8 pixels (240 m), while the 
search window was set automatically to correspond to the maximum expected 
displacement over the baseline duration between the two images, based on pre- 
vious velocity observations*'. 

The processing strategy exploits the redundancy offered by multiple, spatio- 
temporally overlapping pairs to remove velocity outliers efficiently and produce 
robust velocity fields. First, we filter low-quality velocity estimates by applying a 
threshold to the signal-to-noise ratio returned during the feature tracking. The 
threshold value was determined by examining all velocity pairs to identify the 
value beyond which the median absolute deviation (MAD) of stable area velocities 
becomes asymptotic. We use a median-based approach to minimize the impact of 
outliers, and because the distribution of velocity tends not to follow a normal 
distribution’’. Next, for the period 2000-14, we group velocities into one-year 
time periods and for the period 1985-2000, we group velocities into two-year time 
periods (because there were fewer Landsat pairs available). This provides spatio- 
temporal redundancy in the velocity estimates at each pixel and enables us to 
quantify uncertainties. To produce the final velocity field for each period, we 
compute the median of all the available velocity estimates at each pixel. Lists of 
the Landsat pairs that contribute to each period are available in the Supplementary 
Information. Finally, we compute the lo uncertainty of the velocity estimate at 
each pixel in each period. To do so we fit a law of the form: 


kMAD 
0-5 (1) 
2 N* 


where N is the number of velocities used to compute the median velocity, o is the 
lo confidence interval, and k and « are the parameters to be determined. k and « 
were determined for each time period from the velocity estimates made over stable 
land areas whose true value is known and equal to 0. Uncertainty of the final ice 
velocity of each pixel at each time period is then obtained by extrapolating this 
relationship to on-ice areas with the appropriate values of MAD and N for a given 
pixel. Considering N and MAD at the pixel level allows the surface characteristics 
at the location of each pixel (for example, surface conditions or variability of 
velocity during the time period considered) to be taken into account. Pixels with 
o> 60myr ’ are discarded in the subsequent analysis. 

We computed the percentage change in ice velocities between 1985 and 1994, 
and between 2007 and 2014, at all of the pixels that are common to both periods 
(Fig. 1); we then computed the median percentage change, both over the 
whole study area and in elevation bands of 100 m.a.s.l.. For each elevation band 
in Fig. 1b, we calculated the uncertainty of the percentage change by first estim- 
ating the uncertainty of the 1985-94 (e,) and 2007-14 (e2) velocities separately as 


\/Sc%_, o2/N, and then computing \/e:2+e?. Residual striping patterns in 


Fig. 1 are caused by lines of missing data in the Landsat 5 imagery. 

To compute interannual median velocities (Fig. 2b), we discarded first periods 
in which less than 30% of the study area has observations. Then, the pixels com- 
mon to all the retained periods (Extended Data Fig. 4) were selected so as to avoid 
temporal variation caused by spatial bias. For each period we calculated the med- 


ian of the 8,025 temporally common pixels and the associated uncertainty o, 


estimated as ,/ 37, o?/N. The altitudinal distribution of the common sampling 


pixels is shown in Fig. 2c. 

The decision to discard periods in which less than 30% of the study area has 

observations is a compromise between calculating the median velocity of each 
period using the greatest possible number of pixels common to all periods, versus 
retaining the maximum possible temporal resolution. We examined the sensitivity 
of the ice-motion time series (Fig. 2b) to 40% and 50% thresholds. At 40%, there 
are 5,970 more sampling pixels in common than at 30%, but temporal resolution 
decreases as ice velocities observed in 2003-06 are no longer retained. The R’ of the 
two-trend model decreases to 0.65. The rate of slowdown during 2002-14 
increases from 1.5 myr * to 1.9 myr (P<0.01). At 50%, there are 8,397 more 
sampling pixels in common than at 30%. Velocities observed during the 1995-97 
period are also discarded. The R* of the two-trend model is 0.63. The rate of 
slowdown during 2002-14 remains the same as at 40% (P<0.01). There is no 
statistically significant trend in ice motion from 1985 to 2002 (P > 0.05) at either of 
the tested thresholds. From this sensitivity analysis, we conclude that our 30% 
threshold case yields the highest temporal resolution and also the most conser- 
vative trend in ice motion during 2002 to 2014. 
Identification of trends in melting and ice motion. For each time series, we test 
whether it can be divided into temporally distinct populations separated by break 
dates. We apply the Mann-Whitney-Wilcoxon (MWW) rank sum test to invest- 
igate the effect of prescribing different break dates. We chose the MWW test over 
the t-test as we do not know whether the data set follows a normal distribution. 
The test computes the probability that the populations, separated by prescribed 
break dates, are similar. 

The melt time series (Fig. 2a) consists of several years of sustained low melt, 
followed by a period of rising melt and then several years of sustained higher melt. 
We therefore test whether the melt time series can be split into three statistically 
different populations, with two break dates separating the periods of sustained low, 
rising and sustained high melt. We find that the break-date combinations of 
(i) 1992 and 2001, (ii) 1993 and 2001, and (iii) 1993 and 2002 are all significant 
with 95% confidence (Extended Data Fig. 2). To find the best possible combination 
of break dates, we compute the root-mean-squared error, or residuals, of the best- 
fitting three-trend segmented linear regression model (Extended Data Fig. 2). We 
observe the lowest residuals for break dates of 1991-93 and 2001-04. The com- 
bination of break dates that satisfies the MWW test and has the lowest residuals is 
1993 and 2002. With this chosen combination of break dates, the probability that 
(i) 1985-93 is similar to 2002-13 is 0.02%; (ii) 1985-93 is similar to 1994-2001 is 
2%; and (iii) 1994-2001 is similar to 2002-13 is 3%. 

For the ice-motion time series, we first test whether it can be divided into two 
temporally distinct populations, in order to justify the use of segmented linear 
regression (Fig. 2). We apply the MWW test as previously. We find that for break 
dates beyond mid-2001, the null hypothesis (equal median) can be rejected 
(Extended Data Fig. 3a), meaning that the pre- and post-2001 populations are 
statistically different with 95% confidence. To test the ability of two distinct per- 
iods separated by a given break date to represent the velocity time series, we then 
compute the residuals of the best fitting two-trend segmented linear regression 
model for a set of break dates spanning the time period of the data set (Extended 
Data Fig. 3b). We observe a minimum for a break date in 2002, but with a region of 
low residuals spanning the period 1998 to 2004. We conclude from these tests that 
there are two distinct temporal populations of ice motion in our data set and that 
the break point occurs during the period 1998 to 2004. For our analysis (Fig. 2) we 
select a break date of 2002, which corresponds both with the lowest residuals and 
with the MWW test that suggests that the pre- and post-2002 populations are 
statistically different. 

We then examine whether there is a statistically significant relationship between 

meltwater production and ice motion by applying linear regression analysis to 
investigate the extent to which variability in ice motion can be explained by, first, 
temporally coincident, and second, antecedent meltwater production (Extended 
Data Table 1). We quantify antecedent meltwater production in two different 
ways. In one scenario we calculate the mean during the period of observed ice 
motion and the preceding N years. In the other scenario we calculate the mean of 
only the preceding N years. 
Impact of varying baseline durations on annual velocity. Images separated by 
baseline durations of 352-400 days were paired together for feature tracking. Here 
we examine the impact that the variable baseline duration has on the velocity field 
of each period. 

The average start dates of the pairs that comprise each period are shown in 
Extended Data Fig. 1a. The start day becomes less variable once imagery from 
Landsat 7 comes online in 1999. The average baseline duration increases by about 
15 days after 1999 (Extended Data Fig. 1b), increasing the proportion of the 
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baseline that is attributable to summer motion (defined as 1 May to 31 August, in 
common with previous studies**) by about 2% (Extended Data Fig. Ic). 

We use mean summer and winter velocities from Leverett Glacier sites $1 to $4 

during the period 2009 to 2012 (refs 2, 3) to test the sensitivity of annual velocities 
to the varying baseline duration. Winter days in each period are ascribed velocities 
of 81.6 myr | and summer days are ascribed velocities of 127.6 myr_'. We then 
estimate the mean annual velocity that would be expected for each period 
(Extended Data Fig. 1d). Variations in the baseline duration between periods 
are estimated to affect extracted annual ice motion by more than 2myr_'. 
Furthermore, according to this analysis, increased baseline durations during the 
2000s leads to a small artificial increase in ice motion caused by the feature- 
tracking method. This is in the opposite direction to the interannual slowdown 
signal that we observe in our study area, leading us to conclude that our slowdown 
trend is robust to varying baseline durations. 
Impact of changing ice geometry on velocity. During the past 30 years, the GIS 
has thinned along its margins**’’**, changing the geometry of the ice mass. It is 
likely that these changes will have affected ice velocity by modifying driving stress. 
Here we evaluate the impact that thinning may have had on velocity along transect 
A (Fig. 1), in order to bound the extent to which our observed slowdown could be 
the result of geometric changes. 

We characterize the surface velocity u, as the sum of a basal sliding (u,) and a 
vertical shear deformation (ug) contribution*: 


A 
Us = Up + Ua = Ch(p,gSH)” + 4 (Pigs) He (2) 


where A is a temperature-dependent Glen’s flow parameter, p; is the density of ice, 
g is gravitational acceleration, S is the surface slope (positive where the surface 
lowers towards the margin), H is the ice thickness, and C, and m are parameters 
related to basal sliding. A, C, and m are in general poorly constrained, and are 
likely to vary spatially; however, the only assumptions we make in our analysis 
regarding these parameters are that they do not change markedly over the time 
interval of interest, and that m is less than or equal to 3. Note that our model allows 
for either the power-law rheological model of Weertman™ or the Newtonian till 
model of Alley et al.**. Thus the maximum deceleration predicted by the above 
model bounds the slowdown that can be explained by geometric changes alone. 
Below, we estimate this maximum deceleration to first order. 

We introduce the variable 7, which represents the fraction of surface velocity 
that can be explained by vertical shear, that is, ug = Aus, Up =(1—/)us. 

If we consider a small change 6S in slope (6S < S), and a small change 5H in ice 
thickness (6H < H), equation (2) leads to the following change in u,: 


Su, (ms) os (™ +t) oH +0(5H?, 852) 
Us S Us H 


(3) 


(m(1—A)+3A) ms + (m(1 aan + 0(6H?, 5S”) 


where the 6-notation is used to signify terms that are of order 6H? and 6S* or 
higher and thus are negligibly small. Again, we make no assumption regarding the 
spatial variability of A, other than that it is between 0 and 1, as our aim is to find the 


v7 : avg igs —_ wp (9S i 
conditions under which velocity is most sensitive to thinning. With ($) positive 


6H 
and (=) negative in equation (3), then at any point along the transect, and for 


any m = 3, the change to u, cannot be more negative than when the flow is due to 

vertical shear, that is, when / is equal to 1. The first-order relative change in surface 

ou, 

velocity, —, is thus bounded by: 
u 


eae hos (4) 


We use equation (4) to estimate the maximal impact of these thinning scenarios on 
ice velocity, solving every 240 m along transect A. To estimate the total ice thinning 
during 1985-2014, 6H = 10,20 are prescribed at the ice-sheet margin and linearly 
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interpolated along the transect to 6H = 0 m at 100km inland (equivalent to the 
equilibrium line altitude, ~1,500 m.a.s.L, ref. 36) (Extended Data Fig. 5a). We add 
6H to current ice thickness along the transect”* to set H to values appropriate for 
1985. We prescribe the initial slope S as the mean slope in our study area, 
0.02mm!. The change in slope, 6S, is calculated from the prescribed linear 
change in ice thickness over distance inland. 

In Extended Data Fig. 5b we plot equation (4) according to the two thinning 
scenarios described above. These profiles represent the largest (that is, the most 
negative) percentage changes in velocity associated with the prescribed geometric 
change. We then convert these profiles to the predicted reductions in velocity and 
remove them from the observed 1985-94 velocities (Extended Data Fig. 5c), 
generating a lower bound for the 2007-14 velocities under the assumption that 
the observed slowdown was geometrically induced. Between about 0 km and 5km 
from the ice margin for the 6H = 20 m scenario, the observed slowdown is within 
the range predicted by equation (4), and we cannot reject the possibility that the 
thinning here was responsible for the slowdown. However, between 10-50km 
inland, at most 17-33% of the observed slowdown can be attributed to changing 
ice-sheet geometry depending on the prescribed 6H. By 60 km inland, there is 
essentially no net change in ice velocity attributable to geometrical changes. 

Finally we must consider our assumption that the Glen’s law parameter A can be 
treated as constant in time. Phillips et al.°” showed that latent heat transferred to 
the ice from surface melt could warm glacial ice at depth, thereby leading to an 
increase in A. However, this process would result in acceleration rather than 
deceleration. Thus our decision not to consider temporal changes in A in our 
analysis of maximal geometrically induced slowdown is justified. We conclude 
that the slowdown that we have observed during 1985-2014 is not explicable by 
geometrical changes to the ice sheet alone, and instead must be dominated, 
at distances greater than ~5km from the margin, by other processes that affect 
basal motion. 

Surface melting. GIS annual melting was output from a runoff/retention model 
applied to downscaled ERA-interim data from the European Centre for Medium- 
Range Weather Forecasts (ECMWEF) on an equal-area 5 km X 5 km polar stereo- 
graphic grid for the Greenland region**. We calculated mean annual melt rates for 
the study area (67.45° N, 51.5° W to 69.2° N, 49.2° W). Interannual fluctuations 
and trends from several independent melt models show good agreement”, includ- 
ing with the methodology used in this study. 

Code availability. The feature-tracking algorithm is proprietary software 
developed and licensed by GAMMA Remote Sensing. Requests for the code 
underlying the processing strategy used here should be sent to the original 
authors”’. Similarly, requests for the code constituting the runoff/retention model 
should be addressed to its original authors”. 
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Extended Data Figure 1 | Sensitivity of extracted ice motion to variations 
in baseline duration. For each period are shown: a, the average start day-of- 
year (DOY) ofall pairs used in the period; b, the average baseline duration of all 
pairs used in the period; c, the proportion of the baseline duration that is 
attributable to summer, which is defined as 1 May to 31 August; and d, the 
annual velocity that would be expected in the ablation zone of the Leverett 
glacier catchment, based on the average proportion of summer versus winter 
and the average baseline duration for each year. 
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Extended Data Figure 2 | Statistical significance of three different periods 
of surface meltwater production. Hypothesis test of the Wilcoxon rank sum 
test at 95% confidence (see Methods), showing that three periods of surface 
melt separated by the specified dates have statistically different medians 
(outlined white region). The colouring shows the residuals of a three-trend 
linear segmented regression model fitted to melting at each possible combi- 
nation of two break dates, expressed as the root-mean-squared error. The grey 
area is shaded as such for simplicity; it would otherwise be a mirror image 

of the coloured area. 
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Extended Data Figure 3 | Statistical significance of two different periods of 
ice motion. a, Hypothesis test of the Wilcoxon rank sum test for equal 
medians, testing the probability that the two populations (separated by the 
specified date) are similar, with 95% confidence. 0 signifies that the hypothesis 
of equal medians cannot be rejected, and 1 signifies that the hypothesis of 
equal medians can be rejected. b, Residuals shown as the sum-of-squares 
(myr 1) of a two-trend model fitted to velocities at each possible break date. 
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Extended Data Figure 4 | Ice velocities during each period. The velocities have uncertainties < 60 m yr _‘ and were observed across at least 30% of the study area 
in each period (see Methods). 
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Extended Data Figure 5 | Impact of changing ice geometry on ice motion. _ equation (4). c, Left axis, observed ice velocity during 1985-94 (dotted green) 
Ice thinning of 10 m (purple) and 20 m (blue) at the ice margin, through toO0m and 2007-14 (dotted red). Modelled velocities in 2014 (solid lines) for the 

at 100 km inland, was applied to transect A. a, Prescribed change in ice prescribed ice thicknesses. Right axis, ice thickness (dashed grey)’. 

thickness over transect length. b, The ratio of velocity change, calculated from 
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Extended Data Table 1 | Statistical relationship between melting and ice motion 
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The results of linear regression analysis carried out between all periods of ice motion in Fig. 2b and different estimates of temporally coincident“ and antecedent meltwater production (see Methods). 
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The earliest unequivocally modern humans 


in southern China 


Wu Liu'*, Maria Martinon-Torres****, Yan-jun Cai®, Song Xing’, Hao-wen Tong’, Shu-wen Pei, 
Mark Jan Sier*®’, Xiao-hong Wu®, R. Lawrence Edwards’, Hai Cheng"®, Yi-yuan Li", Xiong-xin Yang”, 


José Maria Bermudez de Castro”* & Xiu-jie Wul* 


The hominin record from southern Asia for the early Late 
Pleistocene epoch is scarce. Well-dated and well-preserved fossils 
older than ~45,000 years that can be unequivocally attributed to 
Homo sapiens are lacking’~*. Here we present evidence from the 
newly excavated Fuyan Cave in Daoxian (southern China). This site 
has provided 47 human teeth dated to more than 80,000 years old, 
and with an inferred maximum age of 120,000 years. The morph- 
ological and metric assessment of this sample supports its unequi- 
vocal assignment to H. sapiens. The Daoxian sample is more 
derived than any other anatomically modern humans, resembling 
middle-to-late Late Pleistocene specimens and even contemporary 
humans. Our study shows that fully modern morphologies were 
present in southern China 30,000-70,000 years earlier than in the 
Levant and Europe*’. Our data fill a chronological and geograph- 
ical gap that is relevant for understanding when H. sapiens first 
appeared in southern Asia. The Daoxian teeth also support the 
hypothesis that during the same period, southern China was inhab- 
ited by more derived populations than central and northern China. 
This evidence is important for the study of dispersal routes of 
modern humans. Finally, our results are relevant to exploring the 
reasons for the relatively late entry of H. sapiens into Europe. Some 
studies have investigated how the competition with H. sapiens may 
have caused Neanderthals’ extinction (see ref. 8 and references 
therein). Notably, although fully modern humans were already pre- 
sent in southern China at least as early as ~80,000 years ago, there is 
no evidence that they entered Europe before ~45,000 years ago. This 
could indicate that H. neanderthalensis was indeed an additional 
ecological barrier for modern humans, who could only enter 
Europe when the demise of Neanderthals had already started. 

The Fuyan Cave (25° 39’ 02.7” N, 111° 28’ 49.2” E; 232 m above sea 
level) is located in Tangbei Village, Daoxian County, Hunan Province, 
southern China (Fig. 1). It is part of a large multi-genesis pipeline-type 
karst system that contains several connected and stacked caves 
(Supplementary Information A), and covers an area of more than 
3,000 m?. The investigation and excavations were conducted at three 
regions in the cave, regions I, II and III (Extended Data Fig. 1). From 
2011 to 2013, systematic excavations yielded 47 human teeth and an 
abundant fossil mammalian assemblage (Fig. 2 and Extended Data 
Figs 2 and 3). 

Four clear stratigraphic layers were consistently identified in the 
whole excavated regions (regions I, II and III), with a total thickness 
of more than 250 cm (Fig. 1). All the hominin and mammalian fossils 
were found in layer 2 of region I (mammals) and region II (mammals 
and humans), although three human teeth (DX1, DX2 and DX6) anda 


small amount of mammalian fossils were found out of context as 
surface findings during the first year of excavation. The stratigraphic 
sequence of region II, from top to bottom, is described as follows: (1) 
layer 1: continuous brown-grey and brown-yellow flowstone/calcite- 
cemented deposit with a maximum thickness of 20 cm; (2) layer 2: 
brown-yellow and grey fine sandy clay of 20-50 cm in thickness that 
contains a large amount of mammalian fossils and the hominin teeth; 
(3) layer 3: brown and grey sandy gravel of 80-100 cm in thickness; and 
(4) layer 4: grey-yellow and brown-yellow silt and clay with calcareous 
breccia imbedded. This layer is more than 100 cm in thickness as the 
bottom has not been reached yet. 

At present, no stone tools have been found. The hominin and most 
of the faunal elements consist exclusively of teeth, and many of them 
present root alterations mostly due to the effects of calcium dissolution 
and some rodent gnawing (Supplementary Information B). The mam- 
malian fossil assemblage from the Daoxian site is typical of Late 
Pleistocene in southern China, and is composed of 38 species including 
5 extinct large mammals such as Ailuropoda baconi, Crocuta ultima, 
Stegodon orientalis, Megatapirus augustus and Sus sp. (Extended Data 
Table 1 and Supplementary Information C). The radiocarbon age 
older than 43,000 calibrated years Bp (43 kyr cal Bp) obtained for one 
of the faunal remains (see Supplementary Information D) supports its 
pre-late Late Pleistocene age. 

During the excavations, we collected nine samples of speleothem 
fragments from layers 2 to 3 (FYS-1 to FYS-9) at regions I and II, and 
two subsamples (FYS-S1 to FYS-S2) from a small stalagmite that grew 
on the top of layer 1 (Fig. 1, Extended Data Fig. 1 and Supplementary 
Information E). These samples were carefully preprocessed to single 
out the clean portion for **°Th dating, and then analysed at the Isotope 
Lab of University of Minnesota using the multicollector-inductively 
coupled plasma-mass spectrometry (MC-ICP-MS) dating technique’. 
Eight speleothem fragments from layer 2 yielded Middle to Late 
Pleistocene ages ranging from ~556kyr BP to 120.7 kyr Bp, and one 
sample collected from layer 3 (FYS-9) provided an age older than 
600 kyr BP and thus, beyond the limit of the *°Th dating method 
(Table 1). The two subsamples from the small stalagmite give an age 
of 80.1 + 1.2 kyr Bp and 79.5 + 2.8 kyr Bp (mean + 2s.d.), respectively. 

The calcitic floor (layer 1) is encrusted on layer 2, and is continuous 
across the excavated regions, preventing younger material from 
being introduced into the underlying deposits (see Supplementary 
Information, Cave Tour). The abundant and extensive distribution 
of the fauna and human teeth across the cave makes re-deposition of 
layer 2 highly unlikely. In addition, palaeomagnetic and rock- 
magnetic analysis of a sample layer 1 at region IIA confirms that 
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Figure 1 | Geographical location and stratigraphy of the Daoxian site. 

a, Location of the Daoxian site. Late Middle Pleistocene and Late Pleistocene 
localities with human remains that have been included in the morphological 
and/or metric comparison with Daoxian are also marked on the map. 2: 
Tianyuan Cave; 3: Huanglong Cave; 4: Liujiang; 5: Zhiren Cave; 6: Tubo; 7: 
Xujiayao; 8: Luna; 9: Chuandong; 10: Malu Cave; 11: Lijiang; 12: Longlin; 13: 
Huli Cave; and 14: Xintai. The map is adapted from the original Chinese map 


the flowstone that caps the fossil-bearing layer 2 remains in situ 
(Supplementary Information F and Extended Data Fig. 4). There- 
fore, the dated stalagmite was formed after the fossils were buried 
and it provides a minimum age constraint (~80 kyr) for the fossils 
below. Because the associated fauna is typical of the Late Pleistocene, 
we conservatively assume that fossils are not older than ~120 kyr, and 


DX37-0 
DX37-I 
DX28-o 
DX5-b 


—===] Limestone 
M9 Excavation area 


from National Administration of Surveying, Mapping and Geoinformation 
of China (http://219.238.166.215/mcp/index.asp). b, General view of the 
interior of the cave and the spatial relationship of regions ITA, IIB and IIC, 
with some of the layers marked. c, Plan view of the excavation area. d, Detail 
of the stratigraphic layers of region II of the Daoxian site. All human fossils 
come from layer 2. 


the presence of hominins at Daoxian can be bracketed between 80 kyr 
and 120 kyr. 

Daoxian teeth were compared to large dental samples of Late 
Pleistocene hominin fossils from Europe, Africa and Asia (Extended 
Data Tables 2 and 3 and Supplementary Information G and H). The 
Daoxian teeth are small and they consistently fall within H. sapiens 


DX11-o DX18-o 


Figure 2 | Daoxian human teeth (selection). See Extended Data Table 2 for detailed information about each tooth. b, buccal; d, distal; 1, lingual, m, mesial; 0, 


occlusal. Credits: $.X. and X.-J.W. 
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Table 1 | The 2?°Th ages of the Daoxian site 


Sample ID Region/layer 238 (ppb) 232Th (ppt) 230TH /232Th 

(atomic x 107°) 
FYS-S1 D/layer 1 33.3203 S117 +183 76.5+3.6 
FYS-S2 D/layer 1 285.9+04 55,026 + 1,102 64.0+ 1.3 
FYS-1 A/layer 2 428.2 +0.7 98,699 + 1,976 59.44 1.2 
FYS-2 A/layer 2 10,747.9+ 69.1 27,564 + 552 7463.3 + 149.9 
FYS-3 C/layer 2 26.0 + 0.2 8,675 +174 263.5 +53 
FYS-4 B/layer 2 1,608.5 + 5.5 889+ 18 29,237 + 588 
FYS-5 B/layer 2 260.0 + 0.4 41,356 + 828 22) #25 
FYS-6 A/layer 2 20.6 + 0.2 81,358 + 1,629 22.6+0.5 
FYS-7 D/layer 2 874+0.1 12,302 + 246 20.3225 
FYS-8 D/layer 2 78.4 +0.2 20,571 +413 55.2+1.1 
FYS-9 C/layer 3 267.6+04 20,907 + 419 10,618.9 + 226.1 
*8254Y = ([734U/?38U] activity — 1) X 1,000. 


48° 34Uinitiat WaS Calculated based on 2°°Th age (7), that is, 7°*Uinitial = 67°4U measured X EXP(A234 X 7). 


tCorrected 7°°Th ages assume the initial 7°°Th/?57Th atomic ratio of 4.4 + 2.2 x 10~®©. Those are the valu 


arbitrarily assumed to be 50%. 
BP stands for ‘before present’, in which ‘present’ is defined as the year 1950 ap. Values are mean + 2s.d. 


variability (Fig. 3 and Extended Data Fig. 5). They are generally smaller 
than other Late Pleistocene specimens from Africa and Asia, and closer 
to European Late Pleistocene samples and contemporary modern 
humans. Both the crown and the root of Daoxian teeth show typical 
morphologies for H. sapiens (Fig. 2 and Extended Data Fig. 6), with 
simplified occlusal and labial/buccal surfaces and short and slender 
roots. The presence of moderate basal bulging as well as longitudinal 
grooves in the buccal surface of canines, premolars and molars from 
other Late Pleistocene samples such as Xujiayao, Huanglong Cave, 
Qafzeh or Dolni Vestonice make Daoxian teeth morphologically closer 
to middle-to-late Late Pleistocene and even contemporary human 
samples (Extended Data Fig. 6). Canine and molar roots are gracile 
and barely divergent, differing from the stout and robust root systems 
of Tubo or Xujiayao localities’? where radicals do not narrow towards 
the tip (Extended Data Fig. 6). Indeed, the convergent apices of the 
molar buccal roots appear as a typical feature in contemporary 
H. sapiens. M' molars are also typical of H. sapiens and unlike the 
rhomboidal contour displayed by H. neanderthalensis"' or the bucco- 
lingually elongated shape of Asian H. erectus'**. The relative cusp and 
occlusal polygon areas of the Daoxian M’ molars follow the H. sapiens 


a4u* 230TH/238U 230TH age (kyr BP) 8°4Uinitiart ~°S°Th age (kyr BP)E 

(measured) (activity) (uncorrected) (corrected) (corrected) 
353.1445 0.7326 + 0.0033 81.52+0.7 443 +6 80.1 + 1.2 
356.2+3.2 0.7467 + 0.0026 83.4+0.5 446+5 79.5+28 

54.342.1 0.8302+0.0017 164.7+1.2 85+3 158.3 +4.6 
633.0+4.2 1.1609+0.0077 121.0+1.5 89147 121.0215 

75.6£2.5 1.1000+0.0027 5583+628 364+67 556.8+619 
401.0+3.5 0.9800+0.0035 120.7+0.9 564+5 120.7 +0.9 
173.3227 1.1837+0.0026 351.52+8.1 463 +13 348.3 + 8.2 
171.4+2.7 0.9236+0.0027 1585214 256+10 141.8+12.1 
187.8+5.1 1.0276+0.0055 196.1+3.8 324+ 10 192.9+43 
1574+7.7 0.8786+0.0044 147.1+2.7 234+ 12 140.7 + 5.2 
147.6+3.2 2.5165+0.3707 >600 - - 


es for a material at secular equilibrium, with the bulk earth 23*Th/22°U value of 3.8. The errors are 


pattern and they only differ by 0.6% to 1.1% from modern Chinese 
populations (Extended Data Table 4). Interestingly, Qafzeh M's are 
comparatively less derived than Daoxian, showing a departure from 
the typical H. sapiens pattern of cusp proportions and angles that were 
previously noticed’. The occlusal morphology of Daoxian M* and M’s 
is also simple, and both the metacone and the hypocone are strongly 
reduced as it is typical of H. sapiens. The lack of labial convexity, shovel 
shape and tuberculum dentale, as well as the gracile root of the 
Daoxian incisor I, resemble that of contemporary and Late Pleistocene 
H. sapiens and differs from Neanderthals. However, Dolni Vestonice 
specimens display higher labial convexity, and Qafzeh and Huanglong 
Cave I,s present a more prominent basal eminence, making Daoxian I, 
closer to contemporary humans rather than other Late Pleistocene 
samples. The two Daoxian P3 premolars show a slightly asymmetric 
oval contour due to the disto-lingual projection of a small platform- 
like talonid without accessory cusps. Overall, the crown morphology 
together with the expression of a slender single root of Daoxian P3s is 
closer to H. sapiens and differs from the typical Neanderthal con- 
formation, with compressed and centred occlusal polygon and lin- 
gually displaced metaconid’*. Lower molars lack the typically 
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Figure 3 | Metric comparison of Daoxian teeth. Bivariate plots of the mesiodistal (MD) and buccolingual (BL) dimensions of the Daoxian upper fourth premolar 


(P*), lower canine (C,), lower third premolar (P3) and lower second molar (Mb). 
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Neanderthal combination ofa pit-like anterior fovea with a continuous 
mid-trigonid crest'*!”. This, together with the reduction of the hypo- 
conulid and the expression of an X-pattern in all the M, and M; where 
this feature could be recorded make Daoxian lower molars morpho- 
logically closer to anatomically and contemporary modern humans". 
In addition, no signs of taurodontism are present. Finally, the occlusal 
morphology of the two upper second deciduous molars (dm’s) from 
Daoxian is simple, and the occlusal outline relatively squared and 
unlike the typical skewed contour of Neanderthals. Roots are thin 
and diverge as is typical in deciduous teeth, and similar to the patterns 
usually found in fossil and contemporary H. sapiens. Thus, the mor- 
phological and metric comparison of the Daoxian dental sample 
allows its unequivocal attribution to H. sapiens, and they present 
particular resemblances to late Late Pleistocene samples and contem- 
porary modern humans. 

At present, the earliest unambiguous evidence of H. sapiens fossils 
eastward of the Arabian Peninsula comes from Tianyuan Cave, in 
northern China’, Niah Cave in Borneo* and Lake Mungo in 
Australia” dated to ~40,000-50,000 years. The retention of primitive 
features in Qafzeh and Skhul has been interpreted by many as evidence 
of a ‘failed’ dispersal’’*’, and several studies have recently suggested 
that an earlier and southern route may have been indeed more favour- 
able for hominin expansion””**°. However, these and other related 
hypotheses were lacking the support of clear evidence of modern 
human occupation outside Africa (excluding the Levant) during the 
early Late Pleistocene. The fragmentary nature and/or the mosaic of 
modern and archaic features of remains such as those from the Zhiren 
Cave have prevented a unanimous acceptance of its taxonomic status’”®. 
This, together with the contested chronological-stratigraphic frame of 
some of the Asian hominin findings (see ref. 2 for a review), make the 
Daoxian teeth the earliest and soundest evidence of definitely modern 
humans in southern China at least 80 kyr ago. The Daoxian evidence 
may finally change the scepticism that most hypotheses considering the 
presence of H. sapiens in the early Late Pleistocene in China have been 
subjected to. 

While the Daoxian findings would support the presence of fully 
modern populations in southern China during the early Late 
Pleistocene, the Xujiayao’® and Denisova evidence” points to consid- 
erably more primitive hominins in the northern latitudes. Similarly, 
the dental morphology of the late Middle Pleistocene hominin from 
Panxian Dadong in southern China already exhibits some derived 
features” that are absent in other roughly contemporaneous Asian 
populations of higher latitudes, such as those from Zhoukoudian, 
Hexian or Chaoxian'’. This evidence could support different origins 
and/or dispersal routes for modern humans across Asia**>”*. 

Finally, while fully modern humans succeeded to disperse through- 
out Asia during the early Late Pleistocene, they failed to do so in 
Europe until 35,000-75,000 years later. Thus, we should not rule out 
the possibility that H. neanderthalensis was for a long time an addi- 
tional barrier for modern humans’ expansion, who could only settle in 
Europe when Neanderthal populations started to fade. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Description of the morphological features of the Daoxian hominin teeth follow the 
terminology usually used in dental studies'’”’*°. To assess the morphological 
affinities of Daoxian teeth, we compared them to other Late Pleistocene samples 
from Africa, Asia and Europe (including Neanderthals), as well as a large con- 
temporary H. sapiens sample (see Extended Data Table 3). Apart from both the 
descriptive comparative anatomy and the mesiodistal and buccolingual compar- 
ison, in the case of the M! we also calculated the relative cusp and occlusal 
polygon size. 

For the metric comparison, the crown mesiodistal and buccolingual dimensions 
of the Daoxian teeth were measured with a standard sliding caliper and recorded to 
the nearest 0.1 mm. Bivariate plots of the mesiodistal and buccolingual diameters 
will be provided for the metric comparison of Daoxian with other hominin sam- 
ples. To explore the Daoxian hominins in the context of the Middle to Late 
Pleistocene evolutionary changes in China, some Middle Pleistocene hominins 
from China were also included. 

In addition, the total crown area and relative cusp area, and relative polygon 
areas for upper first molars were also measured and compared to a modern 
Chinese population. These features are considered to be taxonomically discrim- 
inative, particularly between H. sapiens and H. neanderthalensis''. The protocols 
for the measurement and calculation of the relative cusp areas of M’ can be found 
in ref. 11. 
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Extended Data Figure 1 | The Daoxian site. a, Entrance to the Fuyan enlarged area shows the individual location of each human tooth. Lower 
(Daoxian) Cave. b, Image of the intact flowstone in an unexcavated area. pictures provide a detail of the location of each dating sample. FYS, speleothem 
c, Detail of the excavation at region IIC. Pink flags point to in situ human fragment samples; FYS-S, stalagmite samples. For more details on the U-series 


findings. d-g, Detail of the stratigraphy of region IIA (d), IIB (e), IC (f) andIID __ results, see Table 1 and Supplementary Information E. 
(g). In the centre, plan view of the excavation area at the Daoxian Cave. The 
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Extended Data Figure 2 | Daoxian upper teeth. Please see Extended Data Table 2 for detailed information. b, buccal; d, distal; 1, lingual; m, mesial; 0, occlusal. 
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Extended Data Figure 3 | Daoxian lower teeth. Please see Extended Data Table 2 for detailed information. 
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ID# Demagtype DEC INC MAD Q AF/Tinf AF/Tsup NA/A VGPA VGP® 
DIA AF 328 41 7. 1 15 mT 100 mT NA 57.6 356.2E 
DID Hybrid 311 37 10.7 2 15 mT 100 mT A 43.8 9.1E 
DIE TH 335 57 8.1 2 400°C 590°C A 70.4 16E 
D2C Hybrid 91 33 1,2 2 15 mT 70 mT A 11.2 186.8 E 
D2D AF 101 31 3:9 1 15 mT 100 mT NA 3.3 181.3E 
(f) D2E TH 94 37 41 2 250°C 590°C A 10.8 182.9E 
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Extended Data Figure 4 | Palaeomagnetic and rock-magnetic analysis of 
Daoxian flowstone. a, Location of the orientated handsamples. White arrow 
indicates sample D1, black arrow indicates sample D2. b, Zijderveld diagram of 
alternating field demagnetized specimen D1A. Numbers next to the graph 
denote alternating field step in mT. c, Isothermal remanent magnetization 
(IRM) acquisition curve up to 1T for specimens D1A and D2D. d, Progressive 
stepwise thermal demagnetization of an IRM up to 1T of specimen D1A. 

e, Projection of virtual geomagnetic pole (VGP) of sample D1 with associated 
«95. f, Summary table of the thermal (TH), alternating field (AF), and hybrid 


(both AF and TH) palaeomagnetic results. ID# denotes sample identification. 
A, anchored; DEC, declination of characteristic remanent magnetization 
(ChRM) direction; demag, demagnetization; INC, inclination of ChRM 
direction; MAD, maximum angular deviation; NA, not anchored; NRM, 
natural remanent magnetization; Q, quality index of ChRM direction, with 1 
the highest quality and 2 the lowest; VGP, virtual geomagnetic pole latitude. 
AF/Tinf, lowest AF level or temperature step of ChRM in mT or °C; AF/Tsup, 
highest AF level or temperature step of ChRM in mT or °C. 
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Extended Data Figure 5 | Metric comparison of Daoxian teeth. Bivariate plots of the mesiodistal (MD) and buccolingual (BL) diameters of Cc}, P?, M!, M?, M?, 
I,, M; and M; of Daoxian and comparative samples. 
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Extended Data Figure 6 | Morphological comparison of Daoxian teeth. 
Comparative morphology of the Daoxian human teeth with other Pleistocene 
hominins and modern humans. Top left, upper canines. I, V: Daoxian 
(DX37); II, VI: Xujiayao (PA1480); II], VII: Huanglong Cave; IV, VIII: modern 
human. Bottom left: upper third premolars. I, II, III: Daoxian (DX13, DX 29, 
DX42); IV: modern human; V: Chaoxian; VI: Changyang (PA76); VII: 
Panxian Dadong (PA1577); VIII: Xujiayao (PA1480). Top right, upper first 


molars. I: Daoxian (DX28); II: Neanderthal (Petit-Puymoyen Mx6); III: Qafzeh 
5; IV: Tubo (PA1471); V: Hexian (PA836); VI: Chaoxian; VII: Xujiayao 
(PA1480); VIII: modern human. Middle right, lower second molars. I: Daoxian 
(DX30); I: Neanderthal (Hortus IV); II: Dolni Vestonice (DV37); IV: 
Huanglong Cave; V: Xintai; VI: modern human. Bottom right, upper third 
molars. I, lV: Daoxian (DX17), II, VIL: Xujiayao; III, VIII: Huanglong Cave; IV, 
IX: Tubo (PA1476); V, X: modern humans. 
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Extended Data Table 1 | List of faunal composition at Daoxian and other Late Pleistocene localities of southern China 


Fuyan Cave (Daoxian) 


Anourosorex squamipes 
Soriculus sp. 
Rhinolophus ferrumequinum 
Hipposideros armiger 
Eptesicus serotinus 
Murina leucogaster 
Tadarida insignis 
Trachypithecus sp. 
Macaca sp. 

Hylobates sp. 
Pteromyidae indet. 
Rhizomys sp. 

Rattus norvegicus 
Leopoldamys edwardsi 
HAystrix subcristata 
Cuon javanicus 

Ursus thibetanus 
Ailuropoda baconi* 
Martes flavigula 
Arctonyx collaris 

Lutra lutra 

Viverricula sp. 

Viverra sp. 

Crocuta ultima 
Panthera pardus 
Panthera tigris 
Prionailurus bengalensis 
Stegodon orientalis 
Elephas maximus 
Megatapirus augustus 
Dicerorhinus sumatrensis 


Huanglong Cave~ 


+ 
S. leucops 


+ 
+ 


V. zibetha 


D. kirchbergensis 


Zhiren Cave 


+ 
R. pearsoni 
H. pratti 


Petaurista 
+ 


+ 
+ 


+ 
+ 
Felis sp. 


+ 
+ 


Rh. sinensis 


Liujiang’ 


Pongo sp. 


R. rattus 


sp. 
sp. 
sp. 
A. melanoleucus 


sp. 
+ 


+ 
Rh. sinensis 


Sus sp. S. xiaozhu S. cf. xiaozhu 
Sus scrofa - i sp. 
Moschus sp. M. moschiferus 
Muntiacus muntjak ce sp. sp. 
Cervus nippon sp. 
Cervus unicolor - + 
Capricornis sumatraensis + 
Bos (Bibos) gaurus Bubalus Bovidae indet. 
Number of extinct large Number of extinct | Number of Number of extinct 
mammal species: 5 large mammal extinct large large mammal 
species: 9 mammal species: 3 
species: 5 


Percentage of extinct species of 
large mammals: 19% 


Extinct species are marked in bold. References 31-35 are cited in the table. 


Percentage of 
extinct species of 
large mammals: 
25% 


Percentage of 
extinct species 
of large 
mammals: 25% 
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Extended Data Table 2 


List and measurements of Daoxian teeth 


List of the Daoxian dental remains by tooth class with the degree of occlusal wear (following ref. 36) crown measurements, and region and stratigraphic position. 


millimetres. 


Specimen Number Tooth Occlusal Location and 
' Side Wear MD BL Date of discovery stratigraphic 

Field Museum class : 
degree provenience 

DXx1 PA1543 M! i 3 10.5 11.8 October 8, 2011 i 
Out of context 

1 . TIA 

DX2 PA1544 Cc L 5 October 8, 2011 Giner conics 

Dx3 PA1545 P, R 4 October 10, 2011 IIA Layer 2 
DxX4 PA1546 M! L 4 October 10, 2011 IIA Layer 2 
DX5 PA1547 M! R 4 October 10, 2011 IIA Layer 2 

DX6 PA1548 M! R 4 October 11, 2011 a 
Out of context 
DX7 PA1549 C R October 13, 2011 IIA Layer 2 
DxX8 PA1550 M L 5 (10.0) 11.5 September 2, 2012 IIB Layer 2 
DX9 PAI551 Mp L 6 11.0 10.5 September 3, 2012 IIB Layer 2 
DX10 PA1552 M; L, 3. 10.5 12.4 September 3, 2012 IIB Layer 2 
DX11 PA1553 L L | 4 | 60 | September 3, 2012 IIB Layer 2 
DX12 PA1554 M L 3 September 11, 2012 IIB Layer 2 
DX13 PA1555 P L 4 September 11, 2012 IIB Layer 2 
Dx14 PA1556 M- R 3 September 12, 2012 IIB Layer 2 
DX15 PA1557 M, L 4 September 19, 2012 IIB Layer 2 
DX16 PA1558 M L, September 21, 2012 IIB Layer 2 
DX17 PA1559 M L 6 September 22, 2012 IIB Layer 2 
DX18 PA1560 C, R 2 September 22, 2012 IIB Layer 2 
DX19 PA1561 M> L 6 September 23, 2012 IIC Layer 2 
DX20 PA1562 M! L 2 September 23, 2012 IIC Layer 2 
DX21 PA1563 M R 0 September 23, 2012 IIC Layer 2 
DX22 PA1564 M3 L 3 September 23, 2012 IIC Layer 2 
DX23 PA1565 C L 2 September 23, 2012 IIC Layer 2 
DXx24 PA1566 M R September 24, 2012 IIC Layer 2 
DX25 PA1567 M R | 6 | September 24, 2012 IIC Layer 2 
DX26 PA1568 P R 6 September 24, 2012 IIC Layer 2 
DX27 PA1569 M3 R 3 September 24, 2012 IIC Layer 2 
DX28 PA1570 M L 1 September 24, 2012 IIC Layer 2 
DXx29 PAI571 P? L : September 24, 2012 IIC Layer 2 
DX30 PA1581 M; R November 22, 2013 IIC Layer 2 
DX31 PA1582 M L 6 (9.8) 11.5 November 22, 2013 IIC Layer 2 
DX32 PA1583 dm R 5 9.8 10.4 | November 22, 2013 IIC Layer 2 
DX33 PA1584 dm 1 4 8.6 10.2 | November 22, 2013 IIC Layer 2 
DX34 PA1585 P L November 22, 2013 IIC Layer 2 
DX35 PA1586 M R November 22, 2013 IIC Layer 2 
DX36 PA1587 M Ie, 2 10.0 12.0 | November 23, 2013 TIC Layer 2 
DX37 PA1588 Cc! R 2 8.0 November 23, 2013 IIC Layer 2 
DX38 PA1589 pt R 6 6.2 November 24, 2013 IIC Layer 2 
DxX39 PA1590 M R 2 November 24, 2013 IIC Layer 2 
DxX40 PA1591 pt R 6 November 25, 2013 IIC Layer 2 
DxX41 PA1592 M- L 6 November 26, 2013 IIC Layer 2 
Dx42 PA1593 Pp L 1 : November 26, 2013 IIC Layer 2 
DXx43 PA1594 P* R | 6 |  - [| 9.0 [ November 26, 2013 IIC Layer 2 
Dx44 PA1595 M, L 5 November 26, 2013 IIC Layer 2 
Dx45 PA1596 M, R 5 November 26, 2013 IIC Layer 2 
DX46 PA1597 M> R 7 November 26, 2013 IIC Layer 2 
DX47 PA1598 M! R 5 November 27, 2013 IIC Layer 2 
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Geography/ Chronology | Specimens Sources of metrics 
Africa 
Late Pleistocene Herto, Klasies River Mouth*, Mumba ? 
Holocene Mesolithic North African sample* (A falou, 
Tebessa, Ain Meterchem, Gambetta, Ain 
Dokkara, Taforalt 
East Asia 
Mid-Middle Pleistocene | Chenjiawo*, Hexian*, Yiyuan*, ae 
Zhoukoudian ZKD 
Late-Middle Changyang, Chaoxian*, Dingcun*, all 
Pleistocene Jinniushan, Panxian Dadong*, Tongzi*, 
Xujiayao*, Zhoukoudian Locality 4 
Late Pleistocene Bailian Cave, Baojiyan, Changwu, . 
Chuandong, Duan, Huanglong Cave*, Huli 
Cave, Jimuyan, Lipu, Liujiang*, Longlin 
Longdong, Longtanshan, Luna Cave, 
Nanshan Cave, Tiandong, Tianyuan Cave*, 
Tubo*, Xichou, Xintai*, Zhaotong, 
Zhiren Cave*, Upper Cave 
Holocene and Henan Province, Hubei Province --- 
contemporary modern 
humans 
Central Asia 
Late Pleistocene Denisova . 
West Asia 
Late Pleistocene Qafzeh*, Skhul Contributed by 
Wolpoff 
Holocene and Eynan*, Hayonim*, Nahal Oren*, Ohalo* -- 
contemporary modern 
humans 
Neanderthals Amud*, Tabun*, Kebara*, Shanidar -- 
Europe 
Neanderthals Arcy Grotte Renne*, Arcy Hyene*, Arcy Sur Contributed by 
Cure (Mousterian), Chateauneuf, Ehringsdorf, | Wolpoff 
Genay (C6te d'Or), Gibraltar, Hortus, Krapina, 
Kulna, La Chaise, La Ferrassie, La Quina, 
Monsempron*, Le Moustier, Ochoz, Pech de 
l'aze, Petit Puymoyen, Regourdou, 
Saccopastore, Sakajia, Spy, St. Césaire, 
Subalyuk, Vindija 
Late Pleistocene Abri Pataud*, Brno, Combe Capelle, Dolni Contributed by 
Véstonice*, Cro-Magnon, Fontechevade, Wolpoff and a 


Isturitz*, Le Rois*, Les Vachons, Mladeé, 
Pavlov, Predmosti, Saint Germain-La Riviere’, 
Ziaty Kun 


Holocene and 
contemporary modern 
humans 


Hispanic-muslim medieval collection of San 
Nicolas (Murcia, Spain)*, Mesolithic French 
sample* (Téviec and Hoédic), Neolithic 
French sample* (Avize, Dolmens de Bretons, 
Caverne de L’Homme Mort, Orrouy) 


Detailed list of the samples included in the morphological and metric comparison. Asterisk indicates that we examined the original fossil. For the rest, we used high resolution casts. References 37-61 are cited in 


the table. 
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Extended Data Table 4 | Upper first molar relative cusp and occlusal polygon areas 


Samples Protocone Paracone Metacone Hypocone Polygon 
N X+SD N X+SD n X+SD n X+SD n X+SD 
Daoxian 6 32.5410 6 25.9414 6 22.341.3 6 20341.3 4 34.041.8 
Modern humans 50 30.9+1.1 50 27.041.4 50 21.841.5 50 20.341.6 24 37.545.4 
Neanderthal 21 29.9424 21 25.8+2.1 21 20.6+1.8 21 23.742.1 17 26.7+1.8 
Qafzeh 7 31,342.30 7 24.841.6 7 21.3425 7 22.845 4  33.342.7 
LP HSAP 15) 31.8+1.5 15 25.742.3 15 22.441.7 15 20.143 5 32.7+1.9 


Data for Qafzeh and Late Pleistocene H. sapiens are taken from refs 62 and 63. Late Pleistocene H. sapiens (LP HSAP) sample is composed by Dolni Vestonice, Fontechevade, Laugerie Basse, Les Rois, Madeleine, 
Mladec, Patud, St Germaine-la-Riviére and Vachons. 
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Neuroblastoma is a malignant paediatric tumour of the sym- 
pathetic nervous system’. Roughly half of these tumours regress 
spontaneously or are cured by limited therapy. By contrast, 
high-risk neuroblastomas have an unfavourable clinical course 
despite intensive multimodal treatment, and their molecular 
basis has remained largely elusive’*. Here we have performed 
whole-genome sequencing of 56 neuroblastomas (high-risk, 
n= 39; low-risk, n = 17) and discovered recurrent genomic rear- 
rangements affecting a chromosomal region at 5p15.33 prox- 
imal of the telomerase reverse transcriptase gene (TERT). 
These rearrangements occurred only in high-risk neuroblasto- 
mas (12/39, 31%) in a mutually exclusive fashion with MYCN 
amplifications and ATRX mutations, which are known genetic 
events in this tumour type’*°’. In an extended case series 
(n= 217), TERT rearrangements defined a subgroup of high- 
risk tumours with particularly poor outcome. Despite a large 
structural diversity of these rearrangements, they all induced 
massive transcriptional upregulation of TERT. In the remaining 
high-risk tumours, TERT expression was also elevated in MYCN- 
amplified tumours, whereas alternative lengthening of telomeres 
was present in neuroblastomas without TERT or MYCNalterations, 
suggesting that telomere lengthening represents a central mech- 
anism defining this subtype. The 5p15.33 rearrangements juxtapose 
the TERT coding sequence to strong enhancer elements, resulting in 
massive chromatin remodelling and DNA methylation of the affec- 
ted region. Supporting a functional role of TERT, neuroblastoma 
cell lines bearing rearrangements or amplified MYCN exhibited 
both upregulated TERT expression and enzymatic telomerase activ- 
ity. In summary, our findings show that remodelling of the genomic 
context abrogates transcriptional silencing of TERT in high-risk 


neuroblastoma and places telomerase activation in the centre of 
transformation in a large fraction of these tumours. 

Several sequencing studies have been performed to uncover geno- 
mic alterations underlying the diverse clinical phenotypes of neuro- 
blastoma**. Neuroblastoma has a low mutation frequency and a 
heterogeneous mutation spectrum with few recurrently mutated 
genes. Besides MYCN, which is amplified in 20% of neuroblastomas’, 
the most frequently mutated gene is the tyrosine kinase gene ALK, 
altered in 8-10% of the cases®. Furthermore, inactivating ATRX muta- 
tions were found in a small subgroup of neuroblastomas with unfa- 
vourable outcome”. Beyond these alterations, no mechanisms have 
so far been identified that define high-risk neuroblastoma at the 
molecular level. 

We hypothesized that additional structural alterations might occur 
in high-risk neuroblastoma and performed whole-genome sequencing 
of 56 tumours and matched normal controls (Supplementary Table 1). 
By searching for breakpoint clusters occurring within 100 kilobase- 
pair (kb) regions’ we identified four locations exhibiting clustered 
breakpoints in more than three samples (Fig. 1a). Three of these 
regions were related to known genetic alterations in neuroblastoma, 
namely, MYCN amplifications, ATRX deletions, and copy number 
gains of chromosome 17q. The fourth region was located at chro- 
mosome 5p15.33 and affected 12 out of 56 tumours (21%, Fig. la 
and Extended Data Fig. 1). Chromosomal regions translocated to 
the 5p15.33 breakpoints were scattered across chromosome 5 (seven 
cases) and other chromosomes (five cases). The types of structural 
alterations were diverse and included balanced rearrangements, trans- 
locations associated with single copy number gain, and focal, high- 
level amplifications (Fig. 1b). Furthermore, the rearrangement at 
5p15.33 was caused by chromothripsis affecting chromosome 5 in 
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Figure 1 | Genomic rearrangements are clustered at chromosome 5p15.33 
in high-risk neuroblastoma. a, Distribution of genomic rearrangements 
occurring within regions of 100 kb in 56 primary neuroblastomas. 
Rearrangements clustering in more than three tumours are highlighted in red. 
b, Detail of genomic translocations occurring at chromosome 5p15.33 (n = 12) 
and their corresponding rearrangement partner (right). Levels of genomic 
copy numbers are colour-coded. c, Prevalence of MYCN amplification, TERT 
rearrangements, genomic alterations of ATRX and ALK, and chromothripsis 
in 56 primary neuroblastomas. Samples are ordered from left to the right. 
The number of somatic mutations per tumour and the clinical risk group 
assessment are given at the top. Tumour ploidy, telomere length ratio 

(both estimated from sequencing data), age of patient at diagnosis, and tumour 
stage are displayed at the bottom. S, stage 4S tumours. 


two tumours (Fig. 1b and Extended Data Fig. 2a). We noticed that the 
rearrangements in the 5p15.33 region consistently clustered in a region 
50 kb upstream of the TERT transcriptional start site without directly 
affecting the gene or its core promoter region (Fig. 1b)*’. We did not 
observe mutations affecting TERT itself or its promoter'””. 

In accordance with previous studies**, we found an overall low rate 
of non-synonymous mutations with 13.3 mutations per genome on 
average (Fig. 1c and Supplementary Table 2), and a significantly lower 
mutation rate in low-risk tumours than in the high-risk group (low- 
risk, 5.9 + 5.5 mutations; high-risk, 16.6 + 9.9 mutations; P< 0.001). 
Four cases exhibited chromothripsis affecting chromosomes 5, 17, and 
20 (Extended Data Fig. 2). We detected only three genes that were 
expressed and altered in more than two samples (MYCN, n = 10; ALK, 
n=7; and ATRX, n=7). Notably, TERT rearrangements occurred 
exclusively in high-risk tumours in 12 out of 39 of the cases (31%, 
P=0.01), similar to ATRX mutations (Fig. 1c). By contrast, ALK 
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Figure 2 | Genomic TERT rearrangements are associated with poor patient 
outcome and high TERT mRNA expression. a, Prevalence of TERT 
rearrangements in 217 primary neuroblastomas. TERT rearrangements were 
identified by whole-genome or targeted sequencing and break-apart FISH, 
exemplarily shown in the lower panel (red, TERT; green, CLPTMI1L). b, Overall 
survival of neuroblastoma patient groups defined by TERT rearrangements 
(TERT), MYCN amplification (MNA), high-risk disease without these 
alterations (HR), and low-risk or intermediate-risk disease (non-HR). Patients 
with tumours bearing both a TERT rearrangement and MYCN amplification 
(n = 5) were excluded. Overall survival at 5 years: 0.41 + 0.16 (TERT) versus 
0.54 + 0.10 (MNA) versus 0.79 + 0.08 (HR) versus 0.98 + 0.02 (LR). 

c, Multivariable Cox regression analysis of the potential prognostic factors stage, 
MYCN status, and TERT status for overall survival in patients aged >18 months 
(n = 125). d, Distribution of TERT mRNA levels derived from transcriptome 
sequencing of the groups defined above: tumours with TERT rearrangements 
(yellow, n = 10), MYCN amplifications (red, n = 9), high-risk tumours 
without the aforementioned aberrations (grey, n = 18), among which ATRX- 
mutated cases are highlighted by blue circles (n = 7), and low-risk tumours 
(green, n = 17). Error bars, median expression and s.d. e, Comparison of TERT 
expression in those tumour subgroups defined by TERT copy numbers (bottom: 
left, two copies; middle, three or four copies; right, more than four copies). 
TERT expression levels in relation to TERT copy numbers in TERT rearranged 
cases (top). f, Relative average gene expression levels at the TERT locus. 
Subgroups of tumours with TERT rearrangements (yellow), MYCN amplifi- 
cation (red), high-risk tumours without the aforementioned aberrations (grey), 
and low-risk tumours (green) are compared with average expression in a 
large cohort of neuroblastoma samples (n = 498). 


mutations and chromothripsis were found across both risk groups 
(Fig. 1c). Furthermore, TERT rearrangements, ATRX alterations, 
and MYCN amplifications occurred in a mutually exclusive fashion 
within the high-risk group (P = 0.008), suggesting that they may con- 
verge on similar effector functions. 

To further clarify the clinical relevance of TERT rearrangements in 
a large cohort of patients, we examined 161 additional primary 
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Figure 3 | Translocation of active enhancers 
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neuroblastomas by fluorescence in situ hybridization (FISH) and 
targeted sequencing, and found 16 additional TERT-rearranged cases. 
In the entire set (n = 217, Extended Data Table la), we detected 28 
TERT rearrangements (13%), 27 of which occurred in high-risk 
tumours (Fig. 2a and Extended Data Table 1b). The remaining case 
was classified as intermediate risk (stage 3, 8 years at diagnosis), and 
the patient is currently in complete remission after surgery and chemo- 
therapy. TERT rearrangements were strongly associated with the 
unfavourable prognostic markers of stage 4 disease and patient age at 
diagnosis older than 18 months, but occurred predominantly in tumours 
without MYCN amplification in the high-risk cohort (Extended Data 
Table 1b). Together, 27 of 114 high-risk neuroblastomas (24%) and 22 of 
65 MYCN-non-amplified high-risk tumours (34%) harboured TERT 
rearrangements, indicating their frequent occurrence in these groups 
(Fig. 2a). Patients whose tumours harboured TERT rearrangements 
had a poor clinical outcome, which was similar to that of patients with 
MYCN-amplified tumours and significantly worse than that of high-risk 
patients without MYCN amplification or TERT rearrangements (overall 
survival, P = 0.056; Fig. 2b; event-free survival, P = 0.038; Extended 
Data Fig. 3a). The clinical relevance of TERT alterations was substan- 
tiated by multivariable analyses, in which TERT rearrangements pre- 
dicted unfavourable outcome independently of the established variables 
stage and MYCN (Fig. 2c and Extended Data Fig. 3b). 

Since MYCN is a known transcriptional activator of TERT’, we 
compared TERT expression in tumours with 5p15.33 rearrangements 
with that in MYCN-amplified specimens and with the remaining cases. 
TERT expression was significantly higher in neuroblastomas with 
rearrangements than in MYCN-amplified tumours (P = 0.028) and 
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in those without these alterations (P< 0.001, Fig. 2d and Extended 
Data Fig. 3c). The median TERT expression was 92-fold higher in 
TERT-rearranged tumours than in low-risk tumours (Extended Data 
Fig. 3d). In addition, TERT expression was higher in MYCN-amplified 
tumours than in the remaining samples (P = 0.021, Fig. 2d). 

Since the region upstream of TERT consists of condensed chromatin 
in most somatic cells, which keeps TERT in a silenced state’’, we 
hypothesized that the rearrangements may induce TERT transcrip- 
tional upregulation by genomic repositioning, rather than by gain of 
TERT copy numbers. In support of this hypothesis, TERT expression 
was massively increased in TERT-rearranged tumours compared with 
low-risk tumours in both subgroups with and without additional TERT 
copies (Fig. 2e). In addition, TERT copy numbers and messenger RNA 
(mRNA) levels did not correlate in TERT-rearranged neuroblastomas 
(Fig. 2e), indicating that additional TERT copies have little influence on 
TERT expression in these tumours. In line with that notion, we found 
evidence for mono-allelic TERT expression in five evaluable tumours, 
suggesting that only the rearranged allele is expressed in these cases 
(Extended Data Fig. 3e). Additionally, we found no evidence for the 
existence of tandem duplications affecting TERT without the presence 
of complex rearrangements (Fig. 1b). Together, these findings suggest 
that remodelling of the genomic context by the rearrangement itself— 
rather than copy number gain of this region’*'°—is the major cause of 
TERT mRNA upregulation in TERT-rearranged neuroblastomas. 

Wealso noticed that the distance of the breakpoints to the transcrip- 
tional start site of TERT did not affect TERT expression (Extended Data 
Fig. 3f). Furthermore, identical rearrangements and elevated TERT 
expression were detected in four tumours biopsied at initial diagnosis 
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and relapse (Extended Data Fig. 4), supporting the notion that 5p15.33 
rearrangements and consecutive upregulation of TERT are stable gen- 
etic events during neuroblastoma development and progression. 

We observed that in TERT-rearranged neuroblastomas three genes 
(bleyplaby (AceView-annotated human gene; http://www.aceview.org), 
SLC6A18, and SLC6A19) neighbouring TERT on the distal side of the 
breakpoint showed markedly increased expression, while expression of 
suweeby, blorplaby, and CLPTMIL (cleft lip and palate transmembrane 
protein 1-like protein), located on the proximal side of the breakpoint, 
did not follow this pattern (Fig. 2f and Extended Data Fig. 5a). By 
contrast, MYCN-amplified tumours exhibited upregulation of only 
TERT and bleyplaby, but lacked upregulation of SLC6A18 and 
SLC6A19, whereas all four genes were silenced in low-risk neuroblasto- 
mas and in high-risk tumours without TERT rearrangements or MYCN 
amplification (Fig. 2f and Extended Data Fig. 5a, b). Our findings were 
substantiated by analyses of differential gene expression of tumours 
bearing TERT rearrangements or MYCN amplification versus low-risk 
tumours: in both comparisons, TERT was among the most highly upre- 
gulated genes, while SLC6A18 and SLC6A19 were upregulated in TERT- 
rearranged tumours only (Extended Data Fig. 6a, b). Thus, genomic 
rearrangements at 5p15.33 abrogate silencing of gene expression in this 
chromosomal region, whereas amplified MYCN selectively upregulates 
TERT by transcriptional activation. In line with this notion, we found 
that TERT was the most strongly downregulated gene upon short- 
interfering-RNA-mediated MYCN knockdown in a MYCN-amplified 
neuroblastoma cell line (Extended Data Fig. 6c). Together, these obser- 
vations suggest that both 5p15.33 rearrangements and MYCN amp- 
lification converge on TERT activation in high-risk neuroblastoma. 

The massive upregulation of TERT expression observed in tumours 
bearing structural rearrangements of the TERT locus suggested that 
translocation of regulatory elements (for example, enhancers) might 
be responsible for TERT activation. We therefore examined patterns of 
H3K27ac and H3K4mel histone modifications, which are known to 
mark active enhancers'®”’, in three TERT-rearranged tumours using 
chromatin immunoprecipitation coupled to sequencing (ChIP-seq). 
In all three tumours, we detected peak signals in the translocated 
regions that predict the presence of multiple enhancer clusters imme- 
diately adjacent to the breakpoints (Fig. 3a). Ranking of H3K27ac peak 
signals revealed that rearrangements consistently juxtapose TERT to 
strong enhancer elements, several of which are compatible with the 
recent definition of super-enhancers (Fig. 3b)’*. 

To examine the chromatin context of the TERT locus further, we 
analysed additional histone modifications, known to mark active 
promoters (H3K4me3), transcription elongation (H3K36me3), DNA 
methylation (H3K9me3), and Polycomb repressive complex 2 (PRC2)- 
related gene silencing (H3K27me3), in neuroblastoma cells with and 
without TERT rearrangements (Fig. 3c and Extended Data Figs 7 and 
8a). While histone modifications indicative of an active promoter 
(H3K4me3 and H3K27ac) and transcription elongation (H3K36me3) 
were spread to the TERT locus in TERT-rearranged cases, these marks 
were completely lacking in tumours without TERT rearrangements 
(Fig. 3c). By contrast, the promoter and gene body of TERT were 
broadly marked by the repressive mark H3K27me3 in cells lacking 
TERT alterations (Extended Data Fig. 8a). 

DNA methylation analyses of 39 primary neuroblastomas revealed 
a consistent increase in CpG methylation across the TERT locus in 
rearranged and MYCN-amplified tumours (Extended Data Fig. 8b). 
The strongest differential methylation was detected in a CpG site 
located in close proximity to the core promotor of TERT (Fig. 3d 
and Extended Data Fig. 8b), which has been previously associated with 
a disabled repressive element and elevated TERT expression’. Thus, 
structural rearrangements occurring at 5p 15.33 lead to juxtaposition of 
the TERT locus to strong enhancer elements, resulting in a massive 
epigenetic remodelling of the affected region. 

Our findings were further supported by analyses of cell lines derived 
from high-risk neuroblastoma bearing TERT rearrangements, MYCN 
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amplification, or none of these alterations (Extended Data Fig. 7). Cell 
lines with TERT translocations or MYCN amplification showed ele- 
vated TERT mRNA expression compared with cell lines lacking such 
alterations (Fig. 4a). High levels of TERT expression were paralleled by 
a significant increase in enzymatic telomerase activity as assessed by a 
telomeric repeat amplification assay (Fig. 4b). 

ATRX mutations occurred exclusively in tumours lacking MYCN 
amplification and TERT rearrangements (Fig. 1c) and have previously 
been associated with alternative lengthening of telomeres (ALT)”. 
Consistent with the hypothesis that high-risk neuroblastomas may be 
associated with ALT in the absence of TERT activation through MYCN 
amplification or TERT rearrangement, the ATRX-mutant high-risk 
neuroblastoma cell line (CHLA-90) has evidence of ALT pathway 
activation”. Furthermore, we observed activation of the ALT pathway 
in two high-risk neuroblastoma cell lines without TERT or MYCN 
alterations (Extended Data Fig. 9)?’. In line with these and published 
observations’, we found that ATRX-mutated primary neuroblastomas 
and other high-risk tumours lacking TERT or MYCN alterations had 
abundant telomere repeat sequences, pointing towards activation of 
ALT” (Figs 1c and 4c). By contrast, telomeres were short in primary 
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Figure 4 | Telomerase activity is associated with TERT rearrangements 
and MYCN amplification, while ALT occurs in high-risk neuroblastoma 
lacking these alterations. a, b, TERT expression levels as determined by 
transcriptome sequencing (a) and telomerase activity (b) as determined 

by telomeric repeat amplification assay in neuroblastoma cell lines bearing 
TERT rearrangements (GI-ME-N and CLB-GA), MYCN amplification 
(SK-N-BE(2)C and IMR-5/75), and cell lines without these alterations (LAN-6 
and SK-N-FI). ¢, Distribution of telomere length ratios between the tumours 
and matched normals (computed from whole-genome sequencing) in primary 
neuroblastoma subgroups defined by TERT, MYCN, and ATRX alterations 
and risk group (HR, high-risk without the aforementioned alterations; LR, low- 
risk). d, Telomere FISH analyses of two TERT-rearranged (NBL41, NBL37) 
and two ATRX-mutated (NBL08, NBL04) primary tumours. e, A revised model 
for neuroblastoma pathogenesis based on recurrent genomic alterations, the 
presence or absence of telomere maintenance pathways, and clinical courses 
of the disease (modified after ref. 24). In this model, high-risk tumours are 
distinguished from low-risk tumours by active mechanisms of telomere 
lengthening. The most aggressive neuroblastomas are defined by telomerase 
activation as a result of either TERT rearrangement (TERT) or MYCN 
amplification (MNA). In addition, near-diploid (27) or near-tetraploid (4n) 
karyotypes are preferentially observed in high-risk tumours, while near-triploid 
karyotypes are mostly found in low-risk tumours”. Overall survival of 
patient subgroups at 5 years: 0.51 + 0.08 (MNA/TERT) versus 0.79 + 0.08 
(high-risk tumours) versus 0.98 + 0.02 (low-risk tumours). 
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tumours in which telomerase was activated”’. We confirmed this result 
by telomere FISH analysis in primary tumours bearing either ATRX 
mutations or TERT rearrangements (Fig. 4d). In low-risk tumours, the 
telomere lengths were similar to matched normal samples, indicating 
that telomere maintenance mechanisms may be inactive in these 
tumours (Fig. 4c). Together, these findings support the notion that high 
TERT expression mediated by 5p15.33 rearrangements or MYCN amp- 
lification induces constitutive catalytic telomerase activity, whereas 
ALT is activated in high-risk neuroblastoma lacking these alterations. 
Thus, our results suggest that high-risk neuroblastoma is defined mole- 
cularly by mechanisms leading to telomere lengthening. 

In summary, we discovered recurrent TERT rearrangements in 
approximately one-quarter of high-risk neuroblastomas. Our results 
indicate that most high-risk tumours are affected by TERT rearrange- 
ments, MYCN amplification, or ATRX mutations, all of which funnel 
into telomere lengthening, thus providing a molecular, mechanistic 
definition of this neuroblastoma subtype (Fig. 4e). By contrast, low- 
risk tumours are characterized by the absence of such alterations and 
low TERT expression levels, presumably leading to the inability to gain 
immortal proliferation capacity. In our data set, the most aggressive 
subtype of neuroblastoma was defined by telomerase activation as a 
result of either TERT rearrangement or MYCN amplification (Fig. 4e). 
With the further development of telomerase inhibitors, our finding 
might point to a novel therapeutic option for the most aggressive 
subgroup of this deadly paediatric disease. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Patients and tumour samples. Overall, this project comprised tumour samples 
from 217 German patients with neuroblastoma (Extended Data Table 1a). Patients 
were diagnosed between 1991 and 2014 and were registered and treated according 
to several clinical trials of the Gesellschaft fiir Padiatrische Onkologie und 
Hamatologie. The trials were approved by the Ethics Committee of the Medical 
Faculty, University of Cologne. The Institutional Review Board approved collec- 
tion and use of all specimens in this study. Informed consent was obtained from all 
patients. The MYCN gene copy number was determined as a routine diagnostic 
method using FISH analysis. DNA and total RNA was isolated from tumour 
samples with at least 60% tumour cell content as evaluated by a pathologist. 
TERT rearrangements were established as a novel molecular marker in a dis- 
covery cohort of 56 patients. In this set, TERT rearrangements (n = 12) occurred 
exclusively in high-risk patients (n = 39). We sought to validate this finding in a 
larger, representative neuroblastoma cohort, comprising approximately 40% high- 
risk patients. Allowing for a potential occurrence of TERT rearrangements in up to 
10% of non-high-risk patients and ensuring a statistical power of 80%, we esti- 
mated that at least 75 non-high-risk patients were required for validation. We 
therefore investigated 161 additional tumours derived from 86 non-high-risk and 
75 high-risk patients (including 39 and 36 tumours with and without MYCN 
amplification, respectively). The experiments were not randomized. The investi- 
gators were not blinded to allocation during experiments and outcome assessment. 
Cell lines. Neuroblastoma cell lines LAN-6, GI-ME-N, as well as SK-N-FI were 
directly purchased from the Deutsche Sammlung von Mikroorganismen und 
Zellkulturen (DSMZ, Braunschweig, Germany) or American Type Culture 
Collection (ATCC/LGC Standards, Molsheim Cedex, France), respectively. 
Furthermore, SK-N-BE(2)C, IMR-5/75, and CLB-GA were provided by the 
laboratories of H. Deubzer, F. Westermann and J. H. Schulte, respectively. All cell 
lines not directly purchased from ATCC or DSMZ were authenticated by STR 
profiling at the DSMZ. IMR-5/75, SK-N-BE(2)C, GI-ME-N, and CLB-GA were 
grown in RPMI1640 with 10% FBS; LAN-6 and SK-N-FI were cultured in DMEM 
with 20% FBS. All cell lines were cultured without antibiotics and routinely tested 
negative for mycoplasma. 
Whole-genome sequencing. DNA was extracted from fresh-frozen tumour tissue 
and the corresponding matched normal of 56 patients using the Puregene Core Kit 
A (Qiagen) and NucleoSpin Blood DNA extraction kit (Macherey-Nagel), respect- 
ively, according to the manufacturers’ instructions. DNA was eluted in 1 X TE 
buffer, diluted to a working concentration of 100 ng ,l~’ and stored at —80°C. 
Libraries were prepared with the TruSeq DNA PCR-free sample preparation kit 
(Illumina) followed by size selection using SPRI beads (Beckman Coulter 
Genomics). The final libraries were then sequenced on an Illumina HiSeq 2000 
instrument with a paired-end read length of 2 X 100 nucleotides. 
Whole-genome sequencing data analysis. Raw sequencing reads were aligned to 
the human genome (NCBI build 37/hg19) by using BWA (version 0.6.1-1104; 
https://github.com/lh3/bwa). Possible PCR duplicates were then masked in the 
resulting alignment files by searching for concordant read pairs. Next, somatic 
mutations were called using a further development of our in-house mutation 
caller. The major improvements to previous versions” of the caller were that 
identified variants were filtered against a library of more than 500 normals (mixed 
whole-exome and whole-genome) and that the error model contained contribu- 
tions of human library contamination which were estimated from the sequencing 
data. With these modifications we were able to gain a larger sensitivity and spe- 
cificity (data not shown). Rearrangements as well as copy number changes were 
analysed as described previously’. Patterns of recurrent genomic rearrangements 
were identified by scanning the genomes for breakpoint clusters that occur within 
100 kb regions in a similar approach to that described in ref. 7. To compute the 
telomere ratio from whole-genome sequencing data, raw sequencing reads con- 
taining the telomere repeat sequence (TTAGGG), or its reverse complement were 
counted’, and the ratio between the tumour and matched normal was determined. 
This ratio was then normalized to the absolute amount of sequenced DNA using 
the total amount of reads from the tumour and the normal. 
Targeted sequencing. In the validation cohort, hybrid-capture-based target 
enrichment followed by massively parallel sequencing of the genomic region 
encompassing TERT and CLPTMIL was used to detect TERT rearrangements. 
Alignment and the detection of genomic rearrangements were performed analog- 
ous to whole-genome sequencing. We reported only those TERT rearrangements 
that had been detected both by FISH analysis and by targeted sequencing. 
RNA sequencing. RNA sequencing and gene expression analysis was performed 
as described previously”®. Briefly, a Dynabeads mRNA Purification Kit 
(Invitrogen) was used to purify mRNA from total RNA. Library construction 
was performed according to the standard TruSeq protocol. Clusters were gener- 
ated according to the TruSeq PE cluster Kit version 3 reagent preparation 
guide (for cBot-HiSeq/HiScanSQ). High-throughput shotgun sequencing was 
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performed on the IlluminaHiSeq 2000 platform. Paired-end reads with lengths 
of 100 nucleotides were generated. Raw data processing, read mapping, and gene 
expression quantification were done using the Magic-AceView analysis pipeline as 
described’. The Magic analysis tool is accessible at ftp://ftp.ncbi.nlm.nih.gov/ 
repository/acedb/Software/Magic; AceView served as primary transcriptome 
reference (http://www.aceview.org). RNA sequencing was also used to identify 
MYCN-regulated genes in IMR5/75 cells expressing MYCN shRNA under the 
control of the tet repressor. Briefly, for IMR5/75 shRNA inducible cells, 1 ug ml! 
tetracycline or the equivalent of volume of 70% ethanol was added to the cells, and 
cells were incubated for 24 h and then harvested for RNA extraction using RNeasy 
mini kit (Qiagen). Five micrograms of RNA from each sample was processed using 
the RiboGold kit (Epicentre) to remove rRNA from samples to increase reads from 
mRNA. The concentration of the resulting RNA was measured using the Qubit 
RNA assay (Life Technologies). One microgram of RNA was then used to prepare 
libraries for sequencing using the NEB Ultra directional RNA library prep lit for 
Illumina (New England Bioscience) according to the manufacturer’s instructions. 
Microarray. Single-colour gene expression profiles were generated using custo- 
mized 4X44 K oligonucleotide microarrays produced by Agilent Technologies. 
Labelling and hybridization was performed following the manufacturer’s protocol. 
Microarray expression profiles were generated using Agilent’s Feature Extraction 
software (version 9.5.1). Data were normalized using quantile normalization. 
Validation of TERT rearrangements by dideoxy sequencing. Rearrangements 
of the TERT locus were validated by dideoxy sequencing in both diagnostic and 
relapsed tumour samples. Dideoxy sequencing was performed by Seqlab. 

FISH. BAC clones CTD-2191M2 (to detect the region proximal of CLPTM1L) and 
CTD-2511M20 (to detect the TERT/SLC6A18/SLC6A19 loci) were labelled with 
digoxigenin and biotin, respectively (see also Fig. 3a). Cell line cytospin prepara- 
tions were pre-treated with 2 X SSC solution at 37 °C for 30 min, digested with 
Digest-All III (dilution 1:2, Invitrogen) at 37 °C for 6 min, fixed in 4% formalde- 
hyde, and subsequently dehydrated in a graded ethanol series. FISH probes and 
human Cot-1 DNA (Life Technologies) in hybridization buffer (50% formamide, 
10% dextran sulfate sodium, in 2 X SSC) were co-denatured at 85°C for 4min 
and hybridized overnight at 37°C. Post-hybridization washing was done 
with 0.5 X SSC at 75°C for 5 min, followed by washes in PBS, a blocking step 
with CAS-block (Life Technologies, with 10% normal goat serum in PBS) and 
a 1h post-incubation with streptavidin-Alexa-555 conjugates (1:500, Life 
Technologies) and anti-digoxigenin-FITC (1:500, Roche), to enable fluorescence 
detection. After three subsequent washes in PBS, samples were mounted with 
VectaShield mounting media containing 4',6-diamidino-2-phenylindole dihy- 
drochloride (DAPI; Vectorlabs). Images were acquired using an Olympus 
Fluoview FV10 scanning confocal microscope system. 

Telomeric repeat amplification protocol assay. Telomerase activity in cell lines 
was determined with a PCR-based telomeric repeat amplification protocol 
(TRAP) enzyme-linked immunosorbent assay (ELISA) kit (TeloTAGGG 
Telomerase PCR ELISA?!YS, Roche) according to the manufacturer’s protocol. 
ChIP-seq analysis of histone modifications. Formaldehyde cross-linking of cells, 
cell lysis, sonication, ChIP procedure, and library preparation were performed as 
described previously”, starting with approximately 4 10° cells (1 X 10° cells per 
individual immunoprecipitation). Direct cell lysis for each sample was achieved by 
incubation for 30 min in 950 pl RIPA I on ice (10 mM Tris-HCl pH 8.0, 1 mM 
EDTA pH 8.0, 140mM NaCl, 0.2% SDS, 0.1% DOC). Tissue disruption, form- 
aldehyde fixation, and sonication of tumour material were done according to a 
previously published protocol”. Approximately 30 mg of fresh-frozen tumour 
tissue was used per individual ChIP-seq experiment. All subsequent steps were 
performed analogous to cell line experiments. The Bioruptor Plus sonication 
device (Diagenode) was used for high intensity sonication for 30-60 min each 
with intervals of 30s on and 30s off. Library preparation was performed using 
the NEBNext Ultra DNA Library Prep Kit (New England Biolabs) according to the 
manufacturer’s protocol. Samples were mixed in equal molar ratios and sequenced 
on an Illumina sequencing platform. 

ChIP-seq data analysis. Single-end reads were aligned to the hg19 genome using 
Bowtie2 (version 2.1.0). Only uniquely aligned reads were kept. BAM files of 
aligned reads were further processed using the deepTools suite (https://github. 
com/fidelram/deepTools). Input files were subtracted from the treatment files 
using the bamCompare tool, applying the SES method for normalization of signal 
to noise. Resulting signals were normalized to an average 1 X coverage to produce 
signal (bigWig) files. Peaks were called using the MACS 1.4 tool using default 
parameters. 

DNA methylation profiling. DNA was isolated from snap-frozen neuroblastoma 
tissue. Genome-wide DNA methylation was assessed using an Infinium 
HumanMethylation450 BeadChip (Illumina) according to the manufacturer’s 
instructions. Probes were removed on the basis of the following criteria: (1) pro- 
portion of non-detectable f values >0.3 (n =379), (2) single nucleotide 
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polymorphism at or near the targeted CpG site according to R-Forge package IMA 
(https://rforge.net/IMA/, n = 92,600), (3) control probes (n = 65), and (4) map- 
ping to the X or Y chromosome (n = 10,351). Together, 382,182 probes were kept 
for further analysis. The k-nearest neighbours method was used to impute missing 
values, and a subset quantile normalization was applied. TERT-related CpGs were 
annotated using the assignGenomeAnnotation program of the HOMER tool suite 
(http://homer.salk.edu/homer). 

Telomere restriction-fragment analysis and C-circle assay. Both assays have 
been performed as described previously’. Briefly, genomic DNA was purified and 
digested with Alul and Mbol. For restriction-fragment analysis, 10 yg of digested 
DNA was electrophoresed on a 0.8% TBE-agarose gel. Subsequently, telomeric 
DNA was detected by Southern blotting using a [**P]dATP end-labeled 
(CCCTAA), oligonucleotide probe. For the C-circle assay, DNA samples 
(7.5 ng, 101) diluted in ultraclean water were combined with 10 il BSA (NEB; 
0.2 mg ml), 0.1% Tween, 0.2 mM each dATP, dGTP, dTTP, and 1 X ©29 Buffer 
(NEB) in the presence or absence of 7.5 U DDNA polymerase (NEB), incubated at 
30°C for 8h and then at 65°C for 20 min. Reaction products were diluted to 
100 pl with 2 x SSC and dot-blotted onto a 2 X SSC-soaked nylon membrane. 
DNA was ultraviolet cross-linked onto the membrane and hybridized with a 
>2p_end-labelled (CCCTAA), oligonucleotide probe to detect C-circle amplifica- 
tion products. All blots were washed, exposed to PhosphoImager screens, scanned, 
and quantified using a Typhoon 9400 Phospholmager (Amersham, GE 
Healthcare). Genomic DNA from ALT-positive (U2OS) cells served as positive 
control and reference for the quantification of C-circles detected in other cell lines. 
Immunofluorescence-FISH for APBs. ALT-associated promyelocytic leukaemia 
(PML) bodies were visualized by a combination of immunofluorescence with an 
anti-PML antibody and FISH using Alexa-488-(TTAGGG)n PNA probes as 
described previously”’. 

Telomere FISH on tumour sections. Tumours were sliced into 4 1m sections, 
paraffin fixed, embedded in formalin, and mounted onto positively charged glass 
microscope slides. Mounted sections were incubated for 30 min at 55 °C, washed 
three times for 5 min in xylene, rinsed in successive 100%, 95%, and 70% ethanol 
baths, and washed in double-distilled HO and 1% Tween before being placed in 
antigen unmasking solution in a boiling kitchen steam for 30 min. Next, slides 
were rinsed in double-distilled H2O and dehydrated in successive ethanol washes 
of 70%, 95%, and 100%. Slides were incubated at 72 °C for 10 min with an Alexa- 
488 telomeric-C PNA probe and hybridized overnight in a dark humidity cham- 
ber. Slides were washed with PNA wash buffer and PBST and incubated for 10 min 
in DAPI solution. After washing in double-distilled HO, slides were mounted 
with prolong anti-fade mounting medium. Images were taken on a Nikon 90i 
fluorescent light microscope at X63 resolution. Full z-stacks were taken at 0.5 um 
and projected and focused using Elements software. 

Statistical analyses. SPSS (package release 20.0.0, IBM Armonk), R (version 
3.1.2), and GraphPad Prism (version 6.05 GraphPad Software) were applied for 
statistical analyses and data presentation. Overall survival was calculated as the 
time from diagnosis to death from disease or the last follow-up if the patient 
survived. Event-free survival was calculated from diagnosis to the time of tumour 
progression, relapse, or death from disease, or to the last follow-up if no event 
occurred. Survival curves were estimated according to Kaplan-Meier and com- 
pared with the log-rank test (R survival package version 2.15.0). Associations of 
genomic alterations with clinical risk factors were examined using Fisher’s exact 


test. Multivariable Cox regression models were used to analyse the simultaneous 
prognostic impact of TERT rearrangements and established clinical markers (stage 
(1-3, 4S versus 4), MYCN (non-amplified versus amplified), and age (<18 months 
versus >18 months)) on overall survival and event-free survival. Since TERT 
rearrangements were observed only in patients aged >18 months in this study, 
multivariable model building was restricted to this cohort (n = 125) and the vari- 
ables TERT status, stage, and MYCN status. First, the proportional hazard assump- 
tion was assessed for each predictor one-at-a-time using the goodness-of-fit test of 
ref. 30 showing no deviation from the proportional hazard assumption. The 
proportional hazard assumption was considered valid whenever the P value of 
the goodness-of-fit test was >0.05. In addition, predicted survival curves under the 
Cox model were compared with the Kaplan-Meier estimates for each predictor 
supporting adequateness of model fit. Multivariable models were then built using a 
backwards selection procedure including the variables TERT status, stage, and 
MYCN status (inclusion criterion, P value of the score test <0.05; exclusion 
criterion, P value of the likelihood ratio test >0.1). The variables identified at this 
step formed the model of main effects. Finally, the factors selected in the model of 
main effects were fitted with all pairwise interactions in a second block by a 
stepwise forward selection (inclusion criterion, P value of the score test = 0.05; 
exclusion criterion, P value of the likelihood ratio test >0.1), resulting in the final 
model. For the final model, the proportional hazard assumption was assessed 
using the goodness-of-fit test of ref. 30 as well as by fitting extended Cox models 
including the prognostic factors from the final model in a first block and the 
product terms of the prognostic factors with some function of time g(t) in a second 
block with stepwise forward selection in the second block (inclusion criterion, 
P value of the score test <0.05; exclusion criterion, P value of the likelihood ratio 
test >0.1). Choices for g(t) were g(t) = t and g(t) = log(t) with t denoting survival 
time. The proportional hazard assumption was considered as valid if no time- 
dependent factor was selected in any of the extended Cox models and if, addition- 
ally, any P value of goodness-of-fit test was >0.05. 

Analyses of TERT expression levels and methylation between subgroups were 
investigated by Mann-Whitney tests and corrected for multiple hypotheses testing 
using a Bonferroni correction. To test for mutual exclusivity between TERT rear- 
rangements (TERT), MYCN amplifications (MNA), and ATRX mutations 
(ATRX) in the high-risk group, Fisher’s exact tests were performed between every 
alteration and the combination of the remaining two alterations (TERT versus 
MNA and ATRX; MNA versus TERT and ATRX; ATRX versus TERT and MNA). 
The largest P value was finally reported. 
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Extended Data Figure 1 | Validation of rearrangements of the TERT 
locus by dideoxy-sequencing. Sequencing chromatograms of the breakpoint 
regions of 5p15.33 rearrangements along with their genomic coordinates 
(hg19), and the breakpoint-spanning nucleotide sequences. The sequence 


mapping to the TERT locus is indicated in yellow, the rearrangement partner is 
indicated in grey; nucleotides inserted at the breakpoint region are indicated 
in white. 
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Extended Data Figure 2 | Schematic representation of chromothripsis in showing chromothripsis of chromosomes 17 and 20 in the tumours NBL21 
four primary neuroblastomas. a, Circos plots showing chromothripsis of and NBLO8, respectively. Regions showing loss of heterozygosity (LOH) 
chromosome 5 in the tumours NBL39 and NBL37. b, Circos plots are indicated by black segments. 
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Extended Data Figure 3 | TERT rearrangements are associated with poor 
event-free survival of patients, and with TERT upregulation independent of 
the breakpoint distance from the TERT transcriptional start site. a, Kaplan— 
Meier estimates for event-free survival of neuroblastoma patient groups 
defined by TERT rearrangements (TERT), MYCN amplification (MNA), high- 
risk disease without these alterations (HR), and low-risk or intermediate-risk 
disease (nonHR). Patients with tumours bearing both a TERT rearrangement 
and MYCN amplification (n = 5) were excluded. Event-free survival at 5 years: 
0.17 = 0.09 (TERT) versus 0.38 + 0.09 (MNA) versus 0.43 + 0.09 (HR) versus 
0.83 + 0.05 (nonHR). b, Multivariable Cox regression analysis of the potential 
prognostic factors stage, MYCN status, and TERT status for event-free survival 
in patients aged >18 months (n = 125).¢, Validation of TERT expression levels 
by microarrays in the four neuroblastoma subgroups indicated above. Sample 
numbers are given at the bottom. **P < 0.01, ***P < 0.001. d, Fold-change of 


median TERT expression levels in TERT-rearranged and MYCN-amplified 
tumours compared with low-risk (LR) neuroblastomas as measured by 
transcriptome sequencing. e, Evidence for monoallelic expression of TERT in 
five tumours bearing TERT rearrangements. The presence of a heterozygous 
single nucleotide polymorphism and its allelic fraction measured by whole- 
genome sequencing (WGS) is shown on the left of each panel; monoallelic 
expression as established by transcriptome sequencing (RNA-seq) is indicated 
on the right. The genomic position of the single nucleotide polymorphism is 
indicated at the top, the number of reads available for the analysis is shown at 
the bottom. f, TERT expression measured by transcriptome sequencing in 
relation to the distance of the rearrangement breakpoint from the TERT 
transcriptional start site (TSS). TERT expression levels and breakpoint 
distances from the TERT transcriptional start site were not correlated (r = 0, 
P= 0.97; Spearman’s rank correlation test). 
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Extended Data Figure 4 | TERT rearrangements are maintained in relapsed 
neuroblastoma. a, Agarose gel electrophoresis of PCR products representing 
individual TERT rearrangements in four tumours at initial diagnosis (1), 

and at relapse (R). The non-template controls are indicated by C. b, TERT 
expression measured by microarrays in relapsed TERT-rearranged tumours 
(n = 3) compared with TERT-rearranged tumours biopsied at initial diagnosis 
(n = 10). c, Sequencing chromatograms of the breakpoint regions for the 
relapse cases. 


©2015 Macmillan Publishers Limited. All rights reserved 


a 
rarjy (1,178,719 - 1,172,640) SLC6A19 (1,201,710 - 1,225,230) 
20 20 
: ag P = 
8 15 Ns 8154 7 
a tc a 
c c 
2 xe) 
3 3 
210 —_ 3 10 
5 5 
Neeee tle 
. == SS a 
TERT. MNA HR LR TERT MNA HR 
(n = 10) (n= 9) (n = 18) (n = 17) (n = 10) (n=9) (n= 18) (n= = R27) 
SLC6A18 (1,225,470 - 1,246,304) bleyplaby (1,252,249 - 1,255,997) 
20 
2 2 
8 815 
8 3 
§ § 
3 3 
g B10 
a Qa 
8 & 
5 
TERT MNA HR LR TERT MNA_— HR LR 
(n= 10) (n= 9) (n= 18) (n= 17) (n= 10) (n= 9) (n= 18) (n= 17) 
TERT (1,253,287 - 1,295,162) suweeby (1,327,756 - 1,328,222 ) 
20 
2 ‘i gs 
§ 15) 3 
5 ie 5 
3 3 
8 10 ait d 
Qa Qa 
3 3 
TERT NA TERT MNA HR LR 
(n = 10) me = 9) fe Rig) (ne 17) (n = 10) (n= 9) (n= 18) (n= 17) 
blorplaby (1,328,032 - 1,328,566) CLPTM1L (1,318,000 - 1,345,002) 
20 20 
2 Ns 2 
8 15 43 31s) as Se 
§ § ee 
OG GO NS 
Ff] ES > SS in : 
Qa Qa 
3 3 
5 5 
TERT. MNA HR LR TERT. MNA HR LR 
(n = 10) he = 9) (we = Re) fae = B47) (n= 10) (n= 9) (n= 18) (n = 17) 
b 
breakpoint regior 
© 
fo} 
oO 
2) 
Cc 
S 
no 
on 
2 
a 
x 
o 


T T T 
1,000,000 1,250,000 1,500,000 
chr5 position [bp] 


TERT [10] — MNA [9] 


—HR[18] —LR [17] 


Extended Data Figure 5 | Regional effects of TERT rearrangements on gene 
expression patterns. a, Expression levels of genes around TERT measured 
by transcriptome sequencing are shown for tumours with TERT rearrange- 


ments (TERT, yellow), tumours with MYCN amplifications (MNA, red), and 
tumours without these aberrations classified as either high-risk (HR, grey) or 
low-risk (LR, green). Five consecutive genes (TERT, bleyplaby, SLC6A18, 
SLC6A19, and rarjy) located distal of the breakpoint at chromosome 5p15.33, 
and three genes (suweeby, blorplaby, and CLPTMI1L) located proximal of 

the breakpoint are shown. Genomic positions of the genes are indicated 

at the top, and sample sizes are indicated at the bottom. NS, not significant, 
*P< 0.05, **P < 0.01, ***P < 0.001. b, Averaged expression levels of 

genes measured by transcriptome sequencing are indicated for an ~500 kb 
region centring around TERT. Colour codes of neuroblastoma subgroups as 
indicated above. The breakpoint region is indicated in beige. 
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Extended Data Figure 6 | TERT mRNA levels are massively upregulated in 
TERT-rearranged and MYCN-amplified neuroblastomas. a, Differential 
gene expression between TERT-rearranged (n = 10) and low-risk (n = 17) 
neuroblastomas and b, between MYCN-amplified (MNA, n = 92) and low-risk 
tumours (n = 238) measured by transcriptome sequencing”. Negative log(P 
values) are plotted against log,(fold changes) of the genes expressed in 

the respective subgroups. Genes of interest are indicated at the bottom. 
log,(fold changes) correspond to the log difference of established gene 
expression scores. P values were established using Student's t-tests and applying 
a Benjamini and Hochberg false discovery rate multiple testing correction. 


c, Analysis of differential gene expression of a MYCN-amplified neuroblastoma 
cell line (IMR-5/75) before and after shRNA-mediated knockdown of MYCN 
using transcriptome sequencing. The number of genes (frequency) up- or 
downregulated upon MYCN knockdown is plotted against the fold change of 
regulation. TERT is the most strongly downregulated gene upon shRNA- 
mediated MYCN knockdown. Only differentially expressed genes (t-test, 
false discovery rate controlled with P < 0.01) are shown. The insert shows 
reduced MYCN protein levels after induction of a MYCN-specific shRNA in 
IMR-5/75 cells as well as B-actin protein levels as control analysed by 
immunoblotting. 
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Extended Data Figure 7 | Detection of TERT rearrangements in MYCN-amplified. Arrows indicate the unaltered status of chromosome 
neuroblastoma cell lines. a, Schematic representation of binding sites of 5p15.33 (red signals in close proximity to green signals); arrowheads indicate a 


probes used for FISH at the TERT locus. b, FISH analysis of neuroblastoma cell rearrangement of 5p15.33 (red signals without adjacent green signals). 
lines with 5p15.33 rearrangements and without these rearrangements; MNA, 
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Extended Data Figure 8 | Patterns of H3K27me3 histone modification Samples are ordered from top to bottom. CpG sites are indicated relative to 
and DNA methylation at the TERT locus. a, Survey of the histone mark their position to the TERT locus. Average methylation levels of each CpG site in 
H3K27me3 in neuroblastomas harbouring TERT rearrangements (cell line the three subgroups are shown at the top. Similar to the highlighted CpG 
GI-ME-N and primary tumour NBL36) and a neuroblastoma cell line lacking __ site cg11625005 upstream of the TERT transcriptional start site (Fig. 3d), CpG 
these alterations (SK-N-FI). b, DNA methylation patterns of CpG sites at sites scattered over the TERT gene body were methylated significantly higher 
the TERT locus in TERT-rearranged (TERT, n = 6) and MYCN-amplified in TERT-rearranged and MYCN-amplified cases than in tumours without 
(MNA, n = 9) primary neuroblastomas, as well as tumours lacking these these alterations (P < 0.001 each). 


alterations (others, n = 24) using HumanMethylation450 microrrays. 
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Extended Data Figure 9 | ALT activity in neuroblastoma cell lines lacking 
TERT rearrangements and MYCN amplifications. a, Detection of 
extrachromosomal telomeric repeat DNA by C-circle assays with genomic 
Alul/Mbol digested DNA from the indicated neuroblastoma cell lines. The 
ALT-positive control cell line, U2OS, is at the left. b, Detection of ALT- 
associated PML bodies in the indicated cell lines. Telomeric TTAGGG FISH 


LETTER 


DAPI 
Tel-FISH 
PML-IF 


(green) and immunofluorescence for PML (red) were combined and DNA was 
counterstained with DAPI (blue). c, Telomere restriction-fragment analysis of 
telomeric DNA from LAN-6, SK-N-FI, and the ALT-positive control U20S. 


Telomeric DNA was detected by Southern blot with a [?*P]dATP end-labelled 
(CCCTAA), oligonucleotide. 
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Extended Data Table 1 | Prognostic characteristics of the neuroblastoma cohort and prevalence of TERT rearrangements 
in patient subgroups 


a 
Prognostic characteristics of the neuroblastoma cohort 
n % of total 
INSS stage 
1 31 14.3% 
2 28 12.9% 
3 25 11.5% 
4 104 47.9% 
4S 29 13.4% 
Age at diagnosis 
< 18 months 92 42.4% 
> 18 months 125 57 .6 % 
MYCN status 
Normal 168 774 % 
Amplified (MNA) 49 22.6 % 
Risk groups 
High -risk [HR] patients 114 52.5 % 
- HR wlio MNA 65 30.0 % 
Non -HR patients 103 475 % 


Prevalence of TERT rearrangements in prognostic neuroblastoma subgroups 


NALL NTERTr odds ratio P-value 

INSS_ stage 

1-3, 4S 413 1 

4 104 27 38.81 <0.001 
Age at diagnosis 

< 18 months 92 0 

> 18 months 125 28 © <0.001 
Risk groups 

Non-HR patients 103 1 

High -risk [HR] patients 114 27 31.33 <0.001 
MYCN status 

Normal 168 23 

Amplified (MNA) 49 5 [0.72] 0.633 
HRw/o MNA 65 22 

HR w/ MNA 49 5 0.22 0.004 
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Thalamic control of sensory selection 


in divided attention 


Ralf D. Wimmer, L. Ian Schmitt!*, Thomas J. Davidson?, Miho Nakajima’, Karl Deisseroth”** & Michael M. Halassa’>* 


How the brain selects appropriate sensory inputs and suppresses 
distractors is unknown. Given the well-established role of the 
prefrontal cortex (PFC) in executive function’, its interactions with 
sensory cortical areas during attention have been hypothesized to 
control sensory selection”°. To test this idea and, more generally, 
dissect the circuits underlying sensory selection, we developed a 
cross-modal divided-attention task in mice that allowed genetic 
access to this cognitive process. By optogenetically perturbing 
PFC function in a temporally precise window, the ability of mice 
to select appropriately between conflicting visual and auditory 
stimuli was diminished. Equivalent sensory thalamocortical 
manipulations showed that behaviour was causally dependent on 
PFC interactions with the sensory thalamus, not sensory cortex. 
Consistent with this notion, we found neurons of the visual thala- 
mic reticular nucleus (visTRN) to exhibit PFC-dependent changes 
in firing rate predictive of the modality selected. visTRN activity 
was causal to performance as confirmed by bidirectional opto- 
genetic manipulations of this subnetwork. Using a combination 
of electrophysiology and intracellular chloride photometry, we 
demonstrated that visTRN dynamically controls visual thalamic 
gain through feedforward inhibition. Our experiments introduce 
a new subcortical model of sensory selection, in which the PFC 
biases thalamic reticular subnetworks to control thalamic sensory 
gain, selecting appropriate inputs for further processing. 

To dissect the circuit basis of sensory selection, we sought a 
behaviour capable of dividing attention across modalities in the freely 
behaving mouse. Building on the rich history of visual neuroscience’, 
we focused our investigations on visual processing under conditions in 
which vision was behaviourally selected or suppressed (Fig. la). As 
such, we developed and validated a two-alternative forced-choice task 
in which mice selected between conflicting visual and auditory stimuli 
on a trial-by-trial basis. Stimuli indicated the location where a mouse 
had to nose poke to obtain a reward. Trial availability and target 
modality were signalled through binaurally emitted noise. For some 
mice, brown noise (10-kHz low-pass-filtered white noise) signalled a 
visual target (Fig. 1b, top) whereas blue noise (11-kHz high-pass- 
filtered white noise) signalled an auditory target (Fig. 1b, bottom). 
Cueing was counterbalanced across mice, with no effect on perform- 
ance noted. By design, the task was asymmetric, with a visual detection 
component (flash from a light-emitting diode (LED) appearing to the 
right or left) and an auditory discrimination component (upsweep, 
10-14 kHz; downsweep, 16-12 kHz). Multiple quality control metrics 
ensured that mice performed this task using the biasing cues (brown 
and blue noise) rather than low-level alternating strategies (Extended 
Data Fig. 1). Performance on the two modalities was balanced (n = 15 
mice, Fig. 1c). 

Comparing visual detection under cross-modal and visual-only con- 
ditions suggested divided attention between vision and audition in the 
cross-modal task (Fig. 1d). Specifically, the visual detection threshold 


was higher in cross-modal trials (n = 4 mice, P < 0.05, bootstrap com- 
parison; see Extended Data Fig. 2 for single-mouse examples and fixed 
lapse rate analysis). This difference persisted even when the conflicting 
auditory stimulus was randomly but systematically removed (Fig. le 
and Extended Data Fig. 3), suggesting that diminished visual perception 
under cross-modal conditions is a result of expectation (top-down) 
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Figure 1 | Cross-modal divided attention in the mouse. a, Hypothesized 
control of visual gain under cross-modal conditions (LGN, lateral geniculate 
nucleus; V1, primary visual cortex). b, Task design. A mouse is simultaneously 
informed about trial availability and the nature of the target stimulus 
through binaurally delivered noise. In this schematic, brown noise denotes 
‘attend to vision’ and blue noise denotes ‘attend to audition’. Following a 
variable anticipation, during which the mouse is required to hold its snout in a 
centrally located poke, conflicting auditory and visual stimuli are presented. 
By design, the task is asymmetric, having a visual detection component 
(presence or absence of light at the reward location) and an auditory 
discrimination component (upsweep, turn left; downsweep, turn right). 

c, Mice exhibited comparable performance on visual and auditory trials 
(mean + s.e.m., n = 15 mice). d, Visual detection performance in cross-modal 
trials compared to visual-only trials (n = 4 mice, =421 trials per condition). 
Note that both detection threshold and peak performance were lower in the 
cross-modal condition. e, Eliminating the auditory distractor in the cross- 
modal condition did not affect the visual detection psychometric function 

(n = 4 mice, =211 trials per condition). f, When mice were not differentially 
cued but instead ignored the auditory stimulus by learning that it was not 
rewarded over a full session (reversal learning), visual detection threshold did 
not change (n = 6 mice, =242 trials per condition). g, Visual detection 
threshold (bootstrap computed) of the pertinent psychometric functions in 
d-f. Error bars in d-g are 95% confidence intervals and therefore non-overlap 
denotes significance of P< 0.05. Aud., auditory; dist., distractor; vis., visual. 
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rather than sensory interference (bottom-up). Conversely, when mice 
selected targets based on reward history, detection threshold was un- 
affected (Fig. 1f, n = 6 mice). Selective suppression of the visual detec- 
tion threshold in cross-modal trials was observed in both the raw 
(Fig. 1d, e) and the fitted (Fig. 1g) data. Together, these findings suggest 
that a cued, trial-by-trial task design is required for investigating sensory 
selection in divided attention. 

Given the known role of PFC in top-down control of sensory pro- 
cessing and that our psychophysical measurements revealed top-down 
engagement in the cross-modal task, we asked whether cross-modal 
performance was PFC dependent. We targeted the prelimbic cortex 
because of its known homology to primate dorsolateral PFC’. We used 
the VGAT-ChR2 mouse to perturb PFC function in a temporally 
precise manner. In this mouse, the light-activated ion channel chan- 
nelrhodopsin-2 (ChR2) is expressed under the vesicular GABA (y- 
aminobutyric acid) transporter promoter (VGAT)'®. Optogenetic 
drive in this mouse is known to exert intensity- and duration- 
dependent inhibition of excitatory neural activity''’’. Using this 
approach, we observed behavioural disruption only when the PFC 
activity was perturbed during stimulus anticipation (Fig. 2a, n = 4 
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Figure 2 | Evidence for top-down thalamic modulation in divided attention. 
a, Disrupting PFC activity by delivering blue laser pulses (50 Hz, 18 ms, 90% 
duty cycle) impaired task performance at 100% stimulus intensity equally 

on both modalities only when manipulation was performed during stimulus 
anticipation (n = 4 mice, *P < 0.05, Wilcoxon rank-sum test). b, The effect 
was related to the cross-modal nature of the task, not its difficulty, as PFC 
inhibition did not affect performance on a visual-only task. c, Disruption of 
primary visual cortex during stimulus presentation impaired performance on 
visual trials (n = 4 mice). d, The effect in c was related to task difficulty, as 
the visual detection threshold increased in a visual-only task. e, The data in 

a and c do not support a causal role for PFC interactions with primary visual 
cortex in performance. f, Perturbing visual thalamic function in a manner 
similar to cortical perturbations in VGAT-ChR2 mice preferentially 
diminished performance on visual trials during both anticipation and presen- 
tation of target stimuli (n = 12 sessions from 3 mice, *P < 0.05, **P<0.01, 
**P < 0.001, Wilcoxon rank-sum test). g, The finding in f supports a 
model in which PFC activity influences thalamic sensory processing. Bar 
graphs represent mean + s.e.m. Error bars for psychometric curves 

are 95% confidence intervals. NS, not significant; «, detection threshold; 

A, lapse rate. 
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mice). This effect was dependent on the cross-modal nature of the 
task, as perturbing PFC activity in a visual-only task had no effect 
on performance regardless of task difficulty (Fig. 2b, n=3 mice, 
=246 trials per condition). 

We reasoned that the PFC might be exerting its effect on perform- 
ance by biasing sensory circuits towards target stimulus processing and 
distractor suppression. Several studies have suggested that this top- 
down bias is exerted at the level of the sensory cortex*’’. We did not 
find this to be the case in our task; perturbing visual cortical activity 
diminished visual performance only during stimulus presentation 
(Fig. 2c). This effect, unlike that of the PFC, was not cross-modal task 
specific, as it increased the detection threshold in a visual-only task 
(Fig. 2d, n =3 mice, =239 trials per condition, P< 0.01, bootstrap 
comparison). An analogous manipulation of auditory cortex resulted 
in a qualitatively similar effect on performance, but with a larger effect 
size (Extended Data Fig. 4a, n = 4 mice, **P < 0.01, Wilcoxon rank- 
sum test), probably owing to the auditory cortical requirement for 
stimulus discrimination’. Together, these findings support the role 
of sensory cortical areas in stimulus amplification and discrimination, 
but are inconsistent with their being a locus for top-down bias of 
sensory processing. Moreover, optogenetic perturbations of frontal 
regions that project directly to visual and auditory cortices such as 
the anterior cingulate cortex (ACC) and the lateral orbitofrontal cortex 
(OFC) did not affect cross-modal performance (Extended Data Fig. 4b, 
c, n = 8 sessions, 2 mice). In contrast, localized viral injection of AAV- 
hSyn-DIO-ChR2 into the prelimbic cortex of mice in which Cre 
recombinase is expressed in inhibitory neurons (VGAT-Cre), followed 
by optical manipulation, disrupted performance (Extended Data 
Fig. 4d-h). 

Having shown that direct prefrontal-sensory cortical interaction is 
unlikely to account for top-down control of visual processing 
(Fig. 2e), we sought to find the locus of attentional modulation 
observed in the cross-modal task (Fig. 1c). The sensory thalamus 
has been implicated in attentional modulation in primates’*”, raising 
the possibility that it could be a locus of top-down sensory bias. Using 
a similar VGAT-ChR2 strategy, we found that optogenetic perturba- 
tion of the visual thalamus (lateral geniculate nucleus; LGN) during 
either stimulus anticipation or presentation impaired cross-modal 
performance (Fig. 2f, n = 12 sessions, 3 mice). This suggested that 
top-down bias of visual detection may be based in the thalamus 
(Fig. 2g). Visual thalamic manipulation resulted in worsening per- 
formance on both auditory and visual trials, implicating intrathalamic 
interactions in this behavioural effect. Intrathalamic interactions are 
mediated by a shell of GABAergic neurons surrounding thalamic 
nuclei known as the thalamic reticular nucleus (TRN)’®. TRN neu- 
rons have been hypothesized to control the gain of thalamic output 
in a behaviourally relevant manner’’”®’. As such, we asked whether 
the TRN could be a locus of top-down modulation of the sensory 
thalamus. 

To investigate the role of the TRN in visual gain control in divided 
attention, we used an intersectional genetic/connectivity strategy to 
label inhibitory (VGAT-positive) neurons that project to the LGN with 
retrograde lentiviruses” (Fig. 3a and Extended Data Fig. 5a, b) and 
optogenetically tagged them during extracellular recordings in freely 
behaving mice (Extended Data Fig. 5c-f). 

During cross-modal performance, we observed bidirectional modu- 
lation of visTRN neurons consistent with their hypothesized role in 
behaviourally relevant sensory gain control. Specifically, during 
‘attend to vision’ trials, firing rates of these neurons were reduced. In 
contrast, their rates were elevated during ‘attend to audition’ trials 
(Fig. 3b (example), Fig. 3c (population; n = 138 cells) and Extended 
Data Fig. 6). Attentional modulation by the visTRN was eliminated by 
optogenetic PFC disruption (Fig. 3d, e, n = 56 cells). Although PFC 
disruption diminished performance, the effects on visTRN firing rates 
were not simply a covariant of inaccurate performance (Extended Data 
Fig. 7). In contrast to naturally occurring error trials, during which 
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Figure 3 | PFC-dependent visTRN modulation suggests PFC-TRN 
functional coupling is required for visual gain control. a, Intersectional 
tagging of visTRN neurons based on connectivity and genetic identity. Inset, 
maximum projection of ten 1-jum confocal images. Cells were labelled with 
ChR2-eYFP and stained with anti-GFP. eYFP, enhanced yellow fluorescent 
protein; GFP, green fluorescent protein. b, Raster of two visTRN neurons 
triggered on task initiation. Note the reduction of firing rate between the first 
trigger (trial initiation) and the second one (stimulus presentation) during the 
‘attend to vision’ condition, but the opposite during the ‘attend to audition’ 
condition. Red dotted lines indicate triggers. Fading grey boxes denote the jitter 
of the anticipatory period. c, Group analysis of b, showing a scatter plot of 
responses from visTRN neurons (n = 138 cells from 4 animals, P< 0.005, 
Wilcoxon rank-sum test performed over all cells). The orange crosshair 
indicates mean + 95% confidence interval. d, PFC activity was disrupted 
during stimulus anticipation to examine the effect on visTRN activity. e, PFC 
disruption diminished visTRN modulation of attention. f, g, Behavioural 
performance is causally dependent on visTRN modulation of attention. 

f, Optogenetic activation of retrograde-tagged visTRN neurons resulted in 
performance diminishing on visual trials (mean + s.e.m., n = 12 sessions from 
3 mice, *P < 0.05, **P < 0.01, ***P < 0.001, Wilcoxon rank-sum test), 
consistent with this manipulation lowering visual gain. g, In contrast, 
optogenetic inhibition of visTRN neurons preferentially diminished 
performance on auditory trials, consistent with inappropriate visual gain 
increase (n = 12 sessions from 3 mice). 


diminished visTRN firing rate modulation was observed in both 
‘attend to vision’ and ‘attend to audition’ trials, PFC disruption had 
a greater effect on ‘attend to audition’ trials. This result may highlight a 
role of the PFC in distractor suppression. 

To examine whether the physiological effects observed in the 
visTRN were causal to behaviour, we manipulated this subnetwork 
bidirectionally. Although activating the visTRN resulted in similar 
effects to driving LGN inhibition in VGAT-ChR2 mice (Fig. 3f, 
n = 12 sessions, 3 mice), the effect size was smaller, probably reflecting 
the weaker nature of the genetic manipulation (Extended Data Fig. 8). 
This result supports the notion that elevated visTRN firing reduces 
visual thalamic gain. In contrast, inhibiting visTRN function using the 
light-activated Cl! pump eNpHR3.0 diminished performance on 
‘attend to audition’ trials, suggesting it inappropriately enhanced 
visual thalamic gain when it needed to be suppressed (Fig. 3g, 
n = 12 sessions, 3 mice). 
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Figure 4 | Direct evidence for visual thalamic gain modulation in divided 
attention. a, Cartoon depiction of multi-electrode targeting of the LGN at 
2.06 mm posterior to Bregma in freely behaving mice. b, An example of 
differential modulation of a single LGN cell spiking under the two anticipatory 
task conditions. Note that contralateral eye stimulation (with respect to 
recording electrodes) resulted in more robust visual drive. Moreover, the cell 
discharged more spikes during anticipation and presentation when attention 
was directed towards vision. Contra, contralateral; Ipsi, ipsilateral. c, d, Group 
analysis of the phenomenon in b (n = 161 cells from 4 mice, Wilcoxon rank- 
sum test). e, Similarly, enhanced visual responses were observed at the level of 
visual evoked potentials. Top left, example visual evoked potential (VEP); 
bottom left, cumulative distribution of VEP amplitudes, showing higher values 
for ‘attend to vision’ trials (P < 0.01, Kolmogorov—-Smirnov test). Right, average 
VEP from 4 mice (684 visual trials and 633 auditory trials from 29 sessions; 
dashed line indicates baseline potential). Shaded errors are 95% confidence 
intervals. Orange blocks denote visual stimulus presentation. 


To determine whether modulation of visTRN firing rate affected 
visual processing, we investigated LGN spiking in response to 
well-controlled visual stimuli' We implanted mice with multi- 
electrode arrays targeted to the LGN (Fig. 4a). To minimize trial-to- 
trial variability related to slight changes in head and eye position, we 
changed the position of visual stimuli from wall-mounted to head- 
mounted LEDs. LGN neurons showed enhanced baseline and evoked 
activity when attention was directed to vision (Fig. 4b (example), 
Fig. 4c, d, (population, n = 161 cells, 4 mice)), consistent with baseline 
and evoked neuroimaging results observed in human LGN”. 
Differences in evoked responses were also observed in the visual 
evoked potential (Fig. 4e, n = 684 visual trials and n = 633 auditory 
trials, 4 mice). These physiological effects were not observed during 
error trials (Extended Data Fig. 9). 

Suppression of LGN spiking in ‘attend to audition’ trials could be a 
result of direct feedforward inhibition or reduction in feedforward 
excitation. In contrast to many extrareticular inhibitory inputs”, the 
visTRN is known to exert direct feedforward inhibition on LGN neu- 
rons* (Fig. 5a). As such, we sought to measure LGN inhibition 
directly. To do so, we leveraged a recently developed technique known 
as fibre photometry, which has been used to measure bulk changes of 
intracellular Ca** concentration ([Ca?*];) (ref. 24). We modified two 
aspects of conventional photometry to allow interrogation of intra- 
cellular Cl’ concentration ([CI ];), a proxy for synaptic inhibition and 
an otherwise inaccessible measure. First was the genetic labelling of 
neurons with the Cl” indicator SuperClomeleon’’. This fluorescence 
resonance energy transfer (FRET) indicator contains cyan fluorescent 
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Figure 5 | Measuring bulk intracellular [CI’] in vivo shows dynamic changes 
in LGN inhibition during behaviour. a, Possible mechanisms for modulation 
of LGN firing rate: extrareticular inputs can change activity by presynaptic 
inhibition of feedforward excitation, whereas visTRN inhibits LGN directly. 
b, FRET photometry setup and schematic of CFP-to- YFP FRET. c, Cloning of 
SuperClomeleon, a FRET-based Cl indicator, into an adeno-associated virus 
(AAV) followed by in vivo expression in the LGN. hSyn, human synapsin 
promoter; ITR, inverted terminal repeat sequences; pA, polyadenylation signal; 
WPRE, woodchuck hepatitis virus post-transcriptional regulatory element. 

d, Pharmacological confirmation of the technical feasibility of SuperClomeleon 
FRET for GABA,-mediated increase in intracellular [Cl'] by injection of the 
GABA, agonist THIP. Note that the YFP control mice did not show similar 
signals (n = 3 mice per condition; shaded errors are 95% confidence intervals). 
e, Mice showed stronger visual-evoked SuperClomeleon FRET responses 

for the contralateral eye, as would be predicted (n = 3 mice, P< 0.05, Wilcoxon 
rank-sum test). Yellow bars mark the display of the light stimuli. f, Left, cartoon 
depiction of photometry in the cross-modal task, during which the visual 
stimulus was signalled through a head-mounted LED as in Fig. 4 (see 
Supplementary Video 1 for illustration); middle, differential visual-evoked 
[CI]; LGN responses in relation to the modality anticipated (363 correct visual 
trials and 274 correct auditory trials from 6 mice). Shaded errors are 95% 
confidence intervals. Note that ‘attend to audition’ trials showed an earlier 
increase in [CI ]; (decreased SuperClomeleon FRET) and the separation 
between the two traces started before stimulus onset, consistent with 
differential anticipatory changes of visTRN activity; right, optogenetic TRN 
inactivation eliminates this differential response (101 correct visual and 82 
correct auditory trials from 3 mice). Orange bars indicate visual stimulus 
presentation and the yellow bar denotes optogenetic TRN inactivation. 


protein (CFP) as FRET donor and yellow fluorescent protein (YFP) as 
FRET acceptor. Under conditions of elevated [Cl’];, YFP is quenched 
and FRET is reduced (Fig. 5b). Second was the light path; we excited 
SuperClomeleon with 430-nm light and collected both CFP and YFP 
emission data to perform subsequent ratiometric measurements off- 
line (Fig. 5b). To use this technology in vivo, we generated a viral 
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construct harbouring SuperClomeleon 
SuperClomeleon) and injected it into the LGN (Fig. 5c). 

We validated SuperClomeleon FRET as a measure of inhibition 
by two methods. First, we reasoned that pharmacological activation 
of GABA, receptors would increase [Cl]; and reduce YFP emission. 
Indeed, intraperitoneal injection of the GABA, agonist 4,5,6,7- 
tetrahydroisoxazolo(5,4-c)pyridin-3-ol (THIP, 8mgkg™') reduced 
SuperClomeleon FRET in a pharmacokinetically predicted manner”® 
(Fig. 5d, n = 3 mice). Second, we expected visual drive to result in 
inhibition of LGN neurons that balanced excitation. This was 
indeed the case; these signals were larger when visual stimuli were 
delivered to the side contralateral to the recorded LGN than when 
delivered to the ipsilateral side (Fig. 5e, n =3 mice). These rapid 
events were not observed in YFP control mice (Extended Data 
Fig. 10a). SuperClomeleon visual transients were sensitive to the 
GABA, receptor antagonist flumazenil in a dose-dependent manner 
(Extended Data Fig. 10b), confirming that these signals reflect 
GABAergic inhibition. 

Having validated chloride photometry, we asked whether changes 
in visual gain associated with sensory selection were explained by 
opposing changes in LGN feedforward inhibition. We found this to 
be the case; visual-evoked chloride photometry showed significantly 
larger responses in ‘attend to audition’ trials than in ‘attend to vision’ 
trials (Fig. 5f, n = 363 visual trials and n = 274 auditory trials, 6 mice; 
example trials shown in Supplementary Video 1). Signal kinetics in 
these two conditions were distinct: an earlier reduction in FRET ratio 
was seen in auditory selection trials compared to visual selection trials, 
consistent with the differential baseline spiking observed in visTRN 
neurons. Optogenetic vis RN inactivation eliminated this differential 
inhibitory response (Fig. 5f and Extended Data Fig. 10c, n = 3 mice, 
>82 trials). Overall, our data support the model that thalamic gain 
control can be explained by feedforward inhibition and that the TRN is 
the source of this inhibition. More generally, to our knowledge, this 
experiment constitutes the first measurement of inhibitory dynamics 
in freely behaving animals. 

Seminal studies have shown the thalamus to be more than a cortical 
relay”’*. By providing a mechanistic circuit dissection of thalamic 
involvement in divided attention, we extend these studies in two direc- 
tions. First, our findings in mice show the generality of thalamic modu- 
lation of attention across mammalian brains. Second, we provide a first 
description, with causal circuit dependence, of how prefrontal top- 
down control changes thalamic inhibitory dynamics to modulate 
sensory gain. The specific involvement of prelimbic cortex in this 
behaviour, which we further demonstrate through combined optoge- 
netics and chloride photometry (Extended Data Fig. 10d), does not 
eliminate the possibility that the OFC and the ACC may be engaged in 
other types of top-down control, potentially via cortico-cortical inter- 
actions”. In addition to regulating sensory gain, prelimbic control of 
thalamic inhibition may regulate the degree by which relay nuclei 
participate in large-scale functional interactions”. 

The ability to directly measure [Cl]; dynamics provided access to a 
critical biological variable: GABA,-mediated synaptic inhibition. 
Although photometry has already been introduced into neuroscience 
for measurement of [Ca ]; in cell bodies and terminals”®, FRET-based 
chloride photometry has not been performed previously. In this study, 
developing chloride photometry was essential for establishing a direct 
physiological link between visTRN and LGN spiking (technical dis- 
cussion in Supplementary Information). 

Thirty years ago, Francis Crick proposed that the TRN functions as a 
‘searchlight’, directing the internal spotlight of attention to thalamo- 
cortical circuits that process ongoing behavioural demands’. Owing 
to technical limitations, this transformative model has been difficult to 
test, particularly under conditions where the attentional spotlight shifts. 
Our study combined novel and established technology to provide 
mechanistic details for Crick’s ‘searchlight hypothesis’, thereby contrib- 
uting to understanding the circuit mechanisms of sensory selection. 
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METHODS 

Animals. VGAT-ChR2 mice were purchased from the Jackson Laboratory and 
maintained on a C57BI6/J background. VGAT-Cre mice were backcrossed to 
C57B16/J mice for at least six generations. For experiments in Fig. 1, a total of 
fifteen animals were trained, ten of which were later used to establish psychometric 
functions (four for divided attention; six for reversal learning). For Fig. 2, four 
VGAT-ChR2 mice were used for disruption of PFC and primary sensory cortices 
and three mice were used for inactivating LGN. In Fig. 3, four VGAT-Cre mice 
were used for electrophysiological recordings from optogenetically identified 
visTRN neurons, of which two were used for combined electrophysiological 
recordings with optogenetic PFC inactivation (Fig. 3c-e). An additional six mice 
were used for optogenetic activation or inhibition of visTRN (three per manip- 
ulation) during behaviour (Fig. 3f-g). Four wild-type mice were used for LGN 
recordings (Fig. 4). For fibre photometry experiments (Fig. 5), six mice were 
injected with AAV-hSyn-SuperClomeleon for behavioural and pharmacological 
experiments and three YFP control mice were used. Including all animals used for 
Extended Data figures, a total of 28 male mice, 1.5-6 months old, were trained on 
the cross-modal task. All experimental procedures involving animals were per- 
formed according to the guidelines of the Institutional Animal Care and Use 
Committee at the New York University Langone Medical Center and the US 
National Institutes of Health. 

Behavioural training and testing. Behavioural setup. Experiments were con- 
ducted in a custom-built trapezoidal testing chamber (base 1, 12 cm; base 2, 
25 cm; height, 25 cm) positioned over a grid floor. The testing chamber contained 
three nose-pokes, each of which consisted of an infrared LED/infrared phototran- 
sistor pair (Digikey, Thief River Falls, Minnesota) for response detection. 
Activation of a central nose-poke located on the grid floor, 6cm away from the 
reward wall, was required for trial initiation. Two headphone speakers 
(Skullcandy, Park City, Utah) embedded in the floor delivered biasing cues binau- 
rally. Two white LEDs (Mouser, El Cajon, California) were mounted 6.5 cm apart 
on the base wall below two additional nose-pokes. Liquid reward consisting of 
10 tl evaporated milk was delivered directly to these wall-mounted nose-pokes via 
a single-syringe pump (New Era Pump Systems, Farmingdale, New York). Access 
to these response nose-pokes was restricted by a rotating, servo-controlled (Tower 
Hobbies, Champaign, Illinois) disc (radius, 7 cm). Rewards could be accessed from 
these nose-pokes only when two holes in the rotating disc were aligned with the 
underlying nose-pokes. Trial logic was controlled by custom software running on 
an Arduino Leonardo microcontroller (Ivrea, Italy). 

Training. Mice were food restricted to 85-90% of their ad libitum body weight 
before training. Training consisted of multiple levels. First, mice were habituated 
to the test box and allowed to collect reward freely. Reward availability was sig- 
nalled by the rotation of the aforementioned wall-mounted disc. The location of 
reward (left or right poke) was indicated by either a visual or an auditory stimulus. 

For ‘attend to vision’ (visual) trials, the rewarded response poke was indicated 
by illumination of the LED mounted underneath it. In ‘attend to audition’ (aud- 
itory) trials, an upsweep (10-14 kHz, 500 ms) indicated a reward on the left and a 
downsweep (16-12 kHz, 500 ms) indicated a reward on the right. To facilitate 
discrimination learning, sweeps were initially presented in a directional manner. 

Trials were given in single-modality blocks of six, with alternating block type 
(that is, six visual trials followed by six auditory trials; Extended Data Fig. 1, top 
row). The stimulus was presented until the animal collected the reward. An indi- 
vidual trial was terminated 20s after reward collection and a new trial became 
available 5s later. Second, mice learnt to poke to receive a reward. All other 
parameters remained constant. An incorrect poke had no negative consequence. 
By the end of this training stage, all mice collected at least 20 rewards per 
30-minute session. 

Third, mice were trained to initiate individual trials, allowing for the establish- 
ment of a temporal window in which they could anticipate subsequent delivery of 
the stimuli. For successful initiation, mice had to break the infrared beam briefly 
(50 ms) in the initiation poke to trigger stimulus presentation and rotation of the 
wall-mounted disc. Mice were informed about trial availability and modality type 
by brown noise (10-kHz low-pass-filtered white noise, visual trial) or blue noise 
(11-kHz high-pass-filtered white noise, auditory trial) delivered binaurally. At this 
stage, modality types were arranged in a non-conflicting block design (Extended 
Data Fig. 1, top row). Correct poking resulted in reward delivery, whereas incorrect 
poking resulted in immediate termination of the trial by disc rotation, blocking 
access to reward. Rewards were available for 15s following correct poking, fol- 
lowed by a 5-s intertrial interval (ITI). Incorrect poking was punished with a time- 
out, which consisted of a 30-s ITI. Mice could not initiate new trials during an ITI. 
To avoid development of side preferences, the target stimulus would appear at the 
same location as it did on the previous trial following an incorrect response. After 
one week of training on this stage, mice successfully associated the target stimuli 


with the appropriate reward location (Extended Data Fig. 1, top row). At this stage, 
directionality of sound stimuli did not affect performance. 

Fourth, mice had to resolve sensory conflict. Auditory and visual target stimuli 
were always presented in a conflicting manner (Extended Data Fig. 1, middle row). 
The brown and blue noise cues indicated the modality to be selected. During 
a session, four different trial types were presented in blocks in repeating order: 
(1) three auditory trials, (2) three visual trials, (3) six conflict trials with an auditory 
target and (4) six conflict trials with a visual target. To prevent modality prefer- 
ences, an incorrect response resulted in the repetition of the same trial type, 
thereby specifically increasing the block length of the trial types with weak per- 
formance. This training stage was introduced to teach mice to attend only to the 
target modality during a conflict trial. Over the course of this training stage 
(1 week), the duration of the target stimuli was successively shortened to 3s, 1s 
and 0.5s. In parallel, the time that mice had to break the infrared barrier in the 
initiation nose-poke was continually increased and randomized to a final range of 
0.5-0.7 s, rendering the precise presentation time of target stimuli unpredictable. 
Once mice performed successfully on conflict trials (Extended Data Fig. 1, middle 
row) the single-modality trials were removed and block length was reduced to 
three trials. This change in the training paradigm was made to facilitate learning of 
the trial-type cueing (brown and blue noise). 

On the fifth and final stage of training, all block structure was removed and trial 

type was randomized (Extended Data Fig. 1, bottom row). We used three measures 
to ensure that mice followed the trial-type cueing and did not employ simple 
alternating strategies. In addition to computing overall accuracy (Extended Data 
Fig. 1, graphs on left), we quantified the number of consecutive correct trials 
(Extended Data Fig. 1, middle column) and calculated the fraction of correct 
modality switches (Extended Data Fig. 1, graphs on right). At this final stage, 
rewards were available for only 5s. 
Psychophysics. For experiments determining the visual detection psychometric 
function, the ratio between visual and auditory trials was adjusted from the typical 
1/1 to 4/1 to facilitate the acquisition of a larger number of visual trials while 
maintaining the divided-attention nature of the task. In addition, visual stimulus 
duration was shortened to 0.1 s and the light was randomly displayed at one of five 
different intensities (0.15, 0.3, 0.6, 0.9, 1.2 Im). To establish the comparison 
between psychometric functions of visual-only and divided-attention trials 
(Fig. 1c), we trained mice that reached criterion (>70% accuracy) on the cross- 
modal task to perform a visual-only task. For one week, mice were trained on a 
visual-only task every other day; trials containing only visual target stimuli were 
cued by broadband white noise. Subsequently, visual-only trials were introduced 
into the cross-modal task at a 1/4 ratio and in a random interleaved manner. Mice 
were found to differentially anticipate visual-only and visual target with auditory 
conflict trials (Fig. 1c), whereas they continued to perform equally well on conflict 
trials with an auditory target (Extended Data Fig. 3). 

To separate the effect of anticipating a conflicting stimulus (top-down) from the 

presence or absence ofa distracting stimulus itself (bottom-up), we performed two 
experiments. In the first experiment, mice performed the cross-modal task with 
70% conflict trials and 30% in which conflict was expected but auditory distraction 
was removed (Fig. 1d). In the second experiment, mice that had been trained on 
the cross-modal task had the biasing cues replaced with broadband white noise 
and the modality rewarded was changed on a session-by-session basis such that 
mice would deduce it based only on reward history (Fig. le). 
Behavioural analysis and determination of visual detection threshold. 
Performance in behavioural tests was assessed based on the fraction of correct 
responses relative to chance level or guess rate (50%, 1). The visual detection 
threshold («) and maximum performance (A) were estimated by fitting perform- 
ance across stimulation intensities with a logistic function”: 


(=7-A) 
1+exp(— B(x—«)) 


where x corresponds to the five stimulus levels expressed as a percentage of 
maximum stimulus intensity. The fraction of correct trials as a percentage of all 
trials was summed across sessions and the overall performance as a function of 
stimulus intensity was fit using maximum likelihood estimation*® implemented in 
the Palamedes psychophysical toolbox (http://www.palamedestoolbox.org/). 
Estimation of the distribution of the « parameter was made via non-parametric 
bootstrap analysis of curve fits (Fig. 1). To adjust for variable lapse rates (Extended 
Data Fig. 2), the fraction of correct trials was normalized so that the minimum and 
maximum performance rates corresponded to 50% and 100%, respectively”. 
Curve fitting and estimation of the « parameter then proceeded as described above. 
Model selection for the number of psychophysical parameters was based on the 
Akaike information criterion”. 

Optogenetics in behaviour. For experiments with optical stimulation (Fig. 2), 
testing conditions were equivalent to the final stage of training. Laser trains of 


F(x; 2, B,A,y)=y+ 
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either blue (for ChR2 activation) or yellow (for eNpHR3.0 activation) light con- 
sisting of 50-Hz 18-ms pulses (90% duty cycle) at an intensity of 5-6 mW (mea- 
sured at the tip of the optic fibres) were delivered on every other trial. On laser 
trials, stimulation occurred either during the anticipatory period (0.5-0.7 s) or 
during stimulus presentation (0.5). Because behaviour and recording systems 
were automated and stimulus sequence and optogenetic manipulations varied ona 
trial-by-trial basis, researchers were not blinded to the conditions. In the case of 
multiple sequential pharmacological or optogenetic manipulations in the same 
animals, tests were performed in a predefined, pseudorandom order. For compar- 
isons of multiple groups, Kruskal-Wallis one-way analysis of variance was used to 
assess variance across groups before pairwise comparisons. Power analysis based 
on effect size estimates was used to determine sample size required for statistical 
significance with a power of / = 0.7; more than three samples were required to 
detect significant differences. 

Electrophysiology and optical chloride measurements in behaviour. For com- 
bined TRN recordings with optogenetic PFC disruption (Fig. 3d, e), laser trains of 
blue light (as described earlier) were delivered during the anticipatory period on 
every other trial. For electrophysiological recordings of LGN units and fibre pho- 
tometry measurements, visual stimuli were presented through illumination of 
diffusion-coated wide-angle 3-mm flat-top LED lights (LightHouseLEDS, 
Washington), fixed directly on the head of the mouse and centred 8mm from 
the eyes. The LEDs mounted on the base wall of the behavioural box were turned 
off in this condition. These changes allowed emitted light to activate ~150° of the 
visual angle when the eye is centred at rest”. 

Viruses. For retrograde optogenetic tagging and TRN manipulation, FuGB2- 
pseudotyped retrograde lentiviruses (RG-LV) were used as described previously”. 
visTRN neurons were labelled through injection (0.4-0.6 ul) of RG-LV- 
EFla-DIO-ChR2-GFP (for activation) or RG-LV-EFla-DIO-eNpHR3.0-eYFP 
(for inactivation) into the primary visual thalamus (anterior—posterior (A-P), 
—2.1mm; medial-lateral (M-L), +2 mm; dorsal-ventral (D-V), 2.5 mm) using 
a quintessential stereotactic injector (QSI, Stoelting, Wood Dale, Illinois). 
Coordinates are referenced to Bregma. For combined electrophysiological record- 
ings with optogenetic PFC disruption during behaviour, 0.4 pl AAV2-hSyn-DIO- 
ChR2-GEP (titre, 10’? vector core per ml) was injected into the PFC (A-P, 2.6 mm; 
M-L, £0.25 mm; D-V, -1.25mm). To measure chloride flux in the LGN, the 
transgene SuperClomeleon (gift from G. J. Augustine) was cloned into the 
Agel and EcoRI restriction sites of an AAV-hSyn-SSFO-eYFP plasmid to obtain 
AAV-hSyn-SuperClomeleon. The SuperClomeleon recombinant AAV was pack- 
aged as serotype 2 (University of North Carolina, Vector core facility; titre, 10'” 
vector core per ml) and 0.6-0.7 ul virus was injected into the visual thalamus. 
Following injections, mice were allowed to recover for 2-4 weeks to allow for 
virus expression. 

Optic fibre implant experiments. Mice were anaesthetized using 1% isoflurane 
and mounted on a stereotactic frame. For cortical inactivation experiments, up to 
three pairs of 4-5-mm-long optic fibres (Doric Lenses, Quebec, Canada) were 
inserted bilaterally to target up to three different brain areas per mouse (prelimbic 
cortex, 2.6mm A-P, +0.25mm M-L, -1.25mm D-V; primary visual cortex, 
-3.5mm A-P, +2.50mm M-L, -0.50mm D-V; primary auditory cortex, 
-2.8mm A-P, +4.00 mm M-L, -2.00 mm D-V; AAC, 0.5mm A-P, +0.25mm 
M-L, -1.00 mm D-V; lateral OFC, 2.6 mm A-P, +1.50 mm M-L, -2.00mm D-V; 
primary visual thalamus, -2.1mm A-P, +2.00mm M-L, -2.50mm D-V; 
visTRN, -1.6 mm A-P, +2.20mm M-L, -3.00 mm D-V). Two or three stainless 
steel screws were implanted into the skull to anchor the implant and were fixed 
with dental cement. Animals were allowed to recover and training resumed one 
week later. For ChR2 activation a 473-nm laser was used and a 561-nm laser was 
used for eNpHR3.0 activation (Omicron-Laserage, Dudenhofen, Germany). 
Drive construction. Custom drive housings were designed using 3D CAD soft- 
ware (SolidWorks, Concord, Massachusetts) and printed in Accura 55 plastic 
(American Precision Prototyping, Tulsa, Oklahoma) as described previously”. 
Prior to implantation, each drive was loaded with 8-12 independently movable 
microdrives carrying up to 3 nichrome (12.5 jtm) and/or tungsten (25 1m) stereo- 
trodes (California Fine Wire Company, Grove Beach, California). Stereotrodes 
were pinned to custom-designed 32- or 64-channel electrode interface boards 
(EIB; Sunstone Circuits, Mulino, Oregon) along with a common reference wire 
(A-M systems, Carlsborg, Washington). For optogenetic tagging, an optical fibre 
was embedded adjacent to the stereotrode array. In these cases, the optic fibre 
extended 3.5mm from the base of the drive so that it could be stereotactically 
positioned above the TRN during implantation. Targeting of the TRN or LGN was 
achieved by guiding stereotrodes and optic fibres through a square array of poly- 
imide sleeves attached to the base of the drive body. 

Drive and fibre implantation for optical activity measurements. Prior to sur- 
gical implantation, mice were anaesthetized with 1% isoflurane and placed in a 
stereotactic frame. Stainless steel screws were implanted into the skull to provide 


LETTER 


electrical and mechanical stability for the drives. For drive implantations, cranio- 
tomies (~3 mm X 2 mm) were drilled, centred at -2 mm A-P and 2.5mm M-L 
for TRN recordings (15° angled implantation) and at -2.3 mm A-P and -2.5mm 
M-L for LGN recordings. The dura mater was carefully removed and drives were 
centred at the craniotomy coordinates using a custom stereotactic arm. Drive 
bodies were slowly lowered into the craniotomy until stereotrode tips were 
~500 «tm below the cortical surface and optical fibres were positioned just above 
the TRN (2.5mm D-V). For fibre-photometry-based optical recording, low- 
internal-fluorescence optic fibres (400 mm diameter) (Doric Lenses, Canada) were 
implanted just dorsal to the LGN (-2.2 mm A-P, 2.15 mm M-L and 2.6 mm D-V) 
following virus injection. 

Electrophysiological recordings. After mice had recovered from implantation 
surgery, recordings were made using a Neuralynx multiplexing digital recording 
system (Neuralynx, Bozeman, Montana). Signals were acquired using a 32- or 64- 
channel digital headstage connected to the implanted EIB. Signals from each 
electrode were amplified, filtered between 0.1 Hz and 9kHz and digitized at 
30 kHz. Local field potential signals were obtained from a single wire per stereo- 
trode. Following implantation, stereotrode sets were incrementally lowered from 
the cortex into the target thalamic structure over the course of 1-2 weeks 
(Extended Data Fig. 5). Spike sorting was performed offline following acquisition 
based on relative spike amplitude and energy within electrode pairs using the 
MClust toolbox (http://redishlab.neuroscience.umn.edu/mclust/MClust.html). 
Following manual clustering, cross-correlation and autocorrelation analyses were 
used to confirm adequate separation. 

Optogenetically tagged visTRN units were identified based on ChR2-mediated 
response to stimulation using a 473-nm analogue-modulated laser (Omicron- 
Laserage, Dudenhofen, Germany)”. Laser light was delivered by a 200-11m optic 
fibre targeted to the TRN (Extended Data Fig. 5) connected to a fibre optic patch 
cord (200-j1m core, Doric Lenses, Quebec, Canada). The laser intensity was set at 
~8 mW optical output power measured at the patch cord terminus. Fibres were 
polished before implantation so that the power at the tip was =50% maximum, 
resulting in ~4—5-mW laser light being delivered to the brain. Only neurons that 
showed clear transient responses to laser stimulation were included in the analysis. 
Analysis of firing rate. Changes in firing rate during task performance were 
assessed for 138 identified visTRN neurons recorded from four animals and 119 
LGN neurons in two animals. Peri-event time histograms aligned to trial initiation 
and to stimulus presentation were computed using a 5-ms bin width for individual 
neurons in each recording session*. Separate histograms were created for correct 
and incorrect trials within auditory and visual target stimuli and convolved with a 
Gaussian kernel (8 ms half-width at half-height) to create a spike density func- 
tion®°*. The average firing rate across trials was determined during the anticipa- 
tion window before stimulus presentation. The evoked response amplitude was 
estimated by averaging the firing rate within a 100-ms window starting 20 ms after 
stimulus onset. Window duration was chosen based on the latency-to-peak res- 
ponse for point stimuli in the mouse LGN”. 

For normalized rate changes in TRN neurons, firing rates during the attentional 
window in each trial were compared with the baseline firing rate (5-s window, 0.5 s 
before task initiation). Statistical comparison of firing rate changes was used to 
identify neurons with significant task-associated changes in firing rate via non- 
parametric comparison of firing rate during the attentional window and the base- 
line period”*. The test statistic (W) was calculated based on ranking of all trials (N) 
and comparison was performed using the sign function (sgn): 


N 


W=|9[sen(x2j—m1j) Ri] 


ay 


where x, and x, were the attentional window and baseline firing rate, respectively, 
and R denotes the rank. The threshold for significance was set at 0.05 and signifi- 
cantly modulated units were defined as neurons in which the test statistic was less 
than the critical value for the sample size (Waipha(o.os)). Comparison of firing rates 
across trial types (for example, visual versus auditory correct) was performed using 
the Wilcoxon rank-sum test. Homogeneity of variance for firing rates across 
conditions was determined using the Fligner-Killeen test of homoscedasticity. 
Analysis of visual evoked potentials. Visual evoked potentials (VEPs) were 
computed from the broadband LGN local field potential (LFP; 0.1 Hz-10 kHz). 
The particular stereotrode used for VEP analysis in behaviour was selected based 
on the amplitude of responses in post-task recordings during which there were 
many more trials included. Task-related VEPs were averaged during correct aud- 
itory and visual trials across recording sessions. To determine peak response, 
the lowest negative-potential offsets associated with the visual response” 
(0-250-ms window) were identified on a trial-by-trial basis. Signals from indi- 
vidual trials were smoothed with a 25-ms half-width filter over the response 
window before obtaining the peak offset*®. 
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Histology. Mice were euthanized and transcardially perfused with PBS followed 
by 4% paraformaldehyde. Brains were dissected, post-fixed overnight at 4°C and 
sectioned using a vibratome (LEICA, Buffalo Grove, Illinois). 

For GFP enhancement, immunofluorescent staining was carried out on 
50-um-thick sections using chicken anti-GFP (1/1,000, GFP-1020, Aves). 
Sections were incubated overnight with primary antibody in PBS-T (10% normal 
goat serum and 0.05% Tween20) at 4°C. Detection of primary antibodies was 
carried out with Alexa-Fluor-conjugated secondary antibodies (1/1,000, A-11039, 
Invitrogen). All sections were imaged on a Zeiss LSM510 META confocal 
microscope. 

Fibre-photometry-based optical chloride measurements. FRET-based measure- 
ment of chloride was performed during behaviour using a custom-designed fibre 
photometry system™. A fibre-coupled LED (Thorlabs, Newton, New Jersey) light 
source, filtered using a 434-nm clean-up filter (MF434-17 Thorlabs, Newton, New 
Jersey), was used for CFP excitation. Excitation light was split via a long-pass 
dichroic mirror (DMLP425, Thorlabs, Newton, New Jersey) and coupled to a 
400-j1m, 0.48-NA (pharmacology) optic patch cord (Doric lenses, Canada) linked 
to a 400-m chronically implanted optical fibre. Excitation and emission light were 
conveyed by a single patch cord linking the fibre photometry system to the 
implanted fibre. SuperClomeleon CFP and YFP emissions”’ were separated using 
a single-edge beam splitter (FF511-Di01, Semrock, Rochester, New York). Each 
emission wavelength was independently focused onto a separate femtowatt silicon 
photoreceiver (Newport, Irvine, California) using custom optics (12.7-mm focal 
length plano-convex lens mounted in Thorlabs SM1NR05 lens tube). The light 
signal was digitized and recorded using a TDT signal acquisition system (Tucker- 
Davis Technologies, Alachua, Florida). Signal bandwidth was limited to <750 Hz 
based on the photoreceiver response characteristics. The fluorescence ratio was 
calculated across the recording period. To minimize the effect of slow fluctuations, 
normalized delta fluorescence (df/F) was calculated for evoked responses relative to 
the baseline fluorescence level before each event (1-s window). Traces were 
smoothed with a convolution filter (50 ms half-width). Peak response (Extended 
Data Fig. 10) was estimated as the minimum over a 500-ms window following 
stimulus onset. For pharmacological activation of GABA, receptors with 4,5,6,7- 
tetrahydroisoxazolo(5,4-c)pyridin-3-ol (THIP), baseline fluorescence was estimated 
over 5 minutes before injection. For visual stimulation, light pulses of 100-ms 
duration were displayed to the ipsi- or contralateral side of the recorded LGN. 


Effects of the GABA, receptor antagonist flumazenil on visual evoked responses 
were quantified by comparing the average peak response from 
5 minutes before injection (baseline) to one within a 5-minute time window around 
the maximal response suppression (maximal drug effect) and at the end of the 
recording session (recovery, at least 100 min after injection). For optogenetic manip- 
ulations of frontal cortical structures, smaller-diameter patch cords (200 um, 0.37 
NA) were used to allow movement and prevent tangling. For these recordings, 
power analysis was performed to determine sample size required to detect signifi- 
cant differences with a power of f = 0.7 based on the observed differential signal in 
correct auditory and visual trials under baseline conditions. Analysis indicated that 
more than four independent samples would be required to detect a change in these 
differential responses. 
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Extended Data Figure 1 | Cross-modal task training and performance reduction in the error fraction (n = 15 mice,*P < 0.05, Wilcoxon rank-sum 


validation. Quantification of performance across training stages for the cross- __ test), column 2 shows the number of consecutive correct responses (P-values 
modal task. The trial sequence for each training stage is indicated on the left. | shown, Kolmogorov-Smirnov test) and bar graphs on the right (column 3) 
Improved performance was observed in the last three days of training relativeto | showthe probability of a correct response following a modality shift (*P < 0.05, 
the first three for each stage. Bar graphs on the left (column 1) show the Wilcoxon rank-sum test). 
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Single Mouse Data 
Mouse CM019: visual trials 


Mouse R3: reversal learning 
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Extended Data Figure 2 | Effects of cross-modal divided attention in the 
mouse. Top row, single-mouse examples of visual detection performance 
during cross-modal divided attention and reversal learning. Comparison of 
performance under visual-only (black) and cross-modal (green) conditions are 
shown on the left. Although neither condition contained sensory conflict, the 
mere expectation of one increased detection threshold (=124 trials per 


condition). Detection threshold was not affected by the presence of an auditory 
distractor during reversal learning (=90 trials per condition), as shown on the 
right. Middle and bottom rows, group data normalized to peak performance 
(lapse rate), showing that the effects of divided attention on detection threshold 
were persistent. Bootstrap estimation of visual detection threshold shows a 
similar pattern as data in Fig. 1 (error bars are 95% confidence intervals). 
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Extended Data Figure 3 | Comparable performance on trial types andintact auditory and maximum-intensity visual trials (n = 4 mice, same as in Fig. 1d). 
overall auditory performance despite auditory stimuli being eliminatedona _ Right, mice exhibited comparable overall performance when auditory stimuli 
subset of ‘attend to vision’ trials. Left, performance was comparable on were eliminated from a subset of ‘attend to vision’ trials. 
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Extended Data Figure 4 | Region- and timing-specific effects of optogenetic (4 VGAT-ChR2 and 4 VGAT-Cre), *P < 0.05 Wilcoxon rank-sum test). 
manipulation on cross-modal task performance. a, Optogenetic disruption  e—h, Photobleaching experiment to quantify the spread of laser light. A coronal 
of auditory cortex during target stimulus anticipation disrupted performance _ section (e) shows GFP bleaching following two-hour exposure to laser 
specifically for auditory trials (n = 4 mice, **P < 0.01, Wilcoxon rank-sum stimulation (6 mW, 50 Hz, 90% duty cycle). f-h, Fluorescence intensity 


test). Disruption of AAC (b) or lateral OFC (c) in VGAT-ChR2 mice or quantification shows that the extent of light spread is limited to 300 jm around 
following localized injection of a ChR2-expressing virus did not affect the tip of the optic fibre (n = 3 mice). Ant, anterior; FrA, frontal association 
performance (n = 4 mice (2 VGAT-ChR2 and 2 VGAT-Cre), 4 sessions per cortex; lat, lateral; LO, lateral orbitofrontal cortex; med, medial; MO, medial 
manipulation). d, In contrast, inactivation of prelimbic (PL) cortex led to orbitofrontal cortex; post, posterior; Stim, stimulation; Sham, sham surgery 
robust reduction in performance in both types of manipulation (n = 8 mice control; VO, ventral orbitofrontal cortex. Scale bar in e, 200 tum. 
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Extended Data Figure 5 | Independently adjustable, multi-electrode 
recording of visTRN neurons. a, b, Injection of DIO-ChR2-eYFP retrograde 
lentivirus into LGN labels visTRN neurons but not LGN interneurons. a, The 
histological image is the maximal projection of four 2-j1m confocal planes 
showing labelling of visTRN neurons (inset shows a zoom view of cell bodies) 
approximately 1.34 mm posterior to Bregma. b, Image as in a, but from LGN of 
the same animal, approximately 2.46 mm posterior to Bregma (inset shows a 
zoom view of terminals). c, Schematic of independently adjustable multi- 
electrode drive. d, An example of activity recorded from different depths during 
adjustment. Distinct patterns of physiological activity are observed along the 
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trajectory in the broadband local field potential signal (0.1 Hz-32 kHz). The 
numbers correspond to different recording sites (marked by the red dots on 
the schematic in c). e, High-pass-filtered signals (600 Hz-10 kHz) showing 
spiking activity, with isolated clustered units showing distinguishable 
waveform characteristics in distinct structures. f, Example peri-event time 
histograms of ChR2-mediated visTRN response. Top, response to laser 
activation (473 nm, ~4 mW, stimulation, 20 ms). Bottom, response to visual 
stimuli (10-ms pulse). Blue and orange blocks indicate laser and visual 
stimulation, respectively. 
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Extended Data Figure 6 | Distinct changes in visTRN firing rate during 

natural errors compared to errors due to PFC disruption. a-c, Scatter plots 
showing the change in absolute firing rate for visTRN neurons for correct (a), 
incorrect (b) or disrupted-PFC trials (c). Insets show the cumulative probability 
plot of separation from the unity line (no change). Although correct trials had a 


Firing rate (Hz), Attend to audition 


Incorrect 


300 
< 0.05 
100 e 
30 
. e 
10 e 
e o 
3 van 
£ a ee 
1 4 e@ 
2p e 
n= 138 
0.3 
03 «1 3. 10 30 100 


Firing rate (Hz), Attend to vision 


level). 


lower firing rate in ‘attend to vision’ than in ‘attend to audition’ trials (n = 138, 


©2015 Macmillan Publishers Limited. All rights reserved 


PFC disruption 


Oo 
g 300 p=0.56 
5 100 S 
2 ° 
£0). uf 
5 10 og 
z ss ie 
= 3 oom | 
oS g* * 
a] { ee 
e a n=56 
703 
£ 03 1 3 10 30 100 
ir 


Firing rate (Hz), Attend to vision 


P<0.001 Wilcoxon signed-rank test), this pattern was reversed for incorrect 
trials (n = 138, P< 0.05, Wilcoxon signed-rank test); this suggests that perhaps 
the animal was attending to the wrong modality. This reversal was not observed 
in trials with PFC disruption (despite mouse performance being at chance 
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Extended Data Figure 7 | The effect of PFC disruption on visTRN activity is 
distinct from naturally occurring errors. a, Scatter plots of response from 
visTRN neurons, comparing the modulation of their firing rate (change from 
baseline) under the two distinct anticipatory conditions. Each sample is a 
single cell. Colours denote significance reached for each cell on a trial-by-trial 
basis (red, visual; blue, auditory; purple, both; rank-sum-test comparison to 
baseline). Note that in correct performance (n = 138, 4 mice, P< 0.005, 
Wilcoxon signed-rank test), ‘attend to vision’ resulted in a negative shift and 


‘attend to audition’ resulted in a positive shift, consistent with examples shown 
in Fig. 3. During naturally occurring error trials, the modulation is partially 
reversed for both trial types, suggesting that at least a subset of errors are 

the result of attending to the wrong modality. In contrast, PFC disruption 

(n = 56 cells, 2 mice) resulted in a weaker, non-uniform effect (‘attend to visual’ 
trials are less affected). b, Quantification of effects seen in a. N.S., not 
significant. 
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(Fig. 2) resulted in maximal disruption of cross-modal performance. Activating 
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Extended Data Figure 9 | Attentional modulation by LGN is not observed _ signed-rank test, n = 161) among trial types when behavioural outcomes were 
on error trials. a, b, No significant difference was observed in the average incorrect. c, Similar effects were observed for VEPs (visual, n = 324 trials; 


firing rate of LGN neurons during stimulus anticipation (P = 0.63, Wilcoxon _ auditory, n = 302 trials; 4 mice). 
signed-rank test, n = 161 cells, 4 mice) or presentation (P = 0.74, Wilcoxon 
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Extended Data Figure 10 | Light-evoked fast transients from chloride 
photometry measured in the LGN are GABA,-receptor dependent and 
sensitive to visTRN and prelimbic inactivation in the cross-modal task. 

a, Peak SuperClomeleon FRET- and YFP-control responses to light stimuli 
(50 ms, 0.1 Hz) delivered to the eye contralateral to the recorded LGN (n > 90 
trials from 3 mice for SuperClomeleon and from 4 mice for YFP, ***P < 0.001, 
Friedman test). b, Chloride photometry transients are sensitive to the 

GABA, receptor antagonist flumazenil in a dose-dependent manner. Left, 
intraperitoneal injection of 15 mgkg™' flumazenil resulted in a 90% peak 
reduction of light-evoked chloride photometry responses, which recovered 
over the course of 90-100 min as predicted by flumazenil pharmacokinetics. 
Insets show example traces of single events recorded during baseline, peak 
suppression and recovery. Right, quantification of the maximal suppressive 
effects and recovery of 5mgkg™' and 15 mgkg™ flumazenil on chloride 


photometry responses (n> 90 trials from 3 mice, *P < 0.05, **P < 0.01, 
Friedman test). c, Cumulative distributions of unitary visual-evoked 
SuperClomeleon FRET peaks in response to light stimuli in the cross-modal 
task. Under baseline conditions, ‘attend to audition’ trials exhibited 
significantly larger amplitudes than ‘attend to vision’ trials, consistent with 
average data in Fig. 5f. Optogenetic silencing of visTRN neurons eliminated the 
difference between trial types and resulted in peak amplitudes comparable 

to baseline ‘attend to vision’ trials (n = 3 mice, P< 0.005 for ‘attend to audition’ 
trials vs all other trial types, Kolmogorov-Smirnov statistics with Bonferroni 
correction). d, Combined optogenetic and chloride photometry inactivation 
of different frontal cortical regions in the LGN while mice performed the cross- 
modal task. Only PL inactivation eliminates differential inhibition between 
visual and auditory trials (n = 6 mice, ***P < 0.001, Wilcoxon rank-sum test). 
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CMT2D neuropathy is linked to the neomorphic 
binding activity of glycyl-tRNA synthetase 


Weiwei He’, Ge Bai**, Huihao Zhou!, Na Wei', Nicholas M. White’, Janelle Lauer’, Huaqing Liu*, Yi Shi!, Calin Dan Dumitru’, 
Karen Lettieri’, Veronica Shubayev*, Albena Jordanova”, Velina Guergueltcheva®, Patrick R. Griffin?, Robert W. Burgess’, 


Samuel L. Pfaff? & Xiang-Lei Yang! 


Selective neuronal loss is a hallmark of neurodegenerative diseases, 
which, counterintuitively, are often caused by mutations in widely 
expressed genes’. Charcot-Marie-Tooth (CMT) diseases are the 
most common hereditary peripheral neuropathies, for which there 
are no effective therapies”’. A subtype of these diseases—CMT type 
2D (CMT2D)—is caused by dominant mutations in GARS, encod- 
ing the ubiquitously expressed enzyme glycyl-transfer RNA (tRNA) 
synthetase (GlyRS). Despite the broad requirement of GlyRS for 
protein biosynthesis in all cells, mutations in this gene cause a 
selective degeneration of peripheral axons, leading to deficits in 
distal motor function’. How mutations in GlyRS (GlyRS™"?”) 
are linked to motor neuron vulnerability has remained elusive. 
Here we report that GlyRS“"”? acquires a neomorphic binding 
activity that directly antagonizes an essential signalling pathway for 
motor neuron survival. We find that CMT2D mutations alter the 
conformation of GlyRS, enabling GlyRS“™"”” to bind the neuropi- 
lin 1 (Nrp1) receptor. This aberrant interaction competitively inter- 
feres with the binding of the cognate ligand vascular endothelial 
growth factor (VEGF) to Nrp1. Genetic reduction of Nrp1 in mice 
worsens CMT2D symptoms, whereas enhanced expression of VEGF 
improves motor function. These findings link the selective patho- 
logy of CMT2D to the neomorphic binding activity of GlyRS@™7” 
that antagonizes the VEGF-Nrp1 interaction, and indicate that the 
VEGF-Nrp1 signalling axis is an actionable target for treating 
CMT2D. 

CMT diseases are a group of inherited disorders that specifically 
affect the peripheral nervous system and are characterized by progress- 
ive weakness and atrophy in the hands and feet”’. Recent progress in 
neurogenetic studies has uncovered aminoacyl-tRNA synthetases as 
the largest gene family implicated in CMT. Among them, GARS, 
encoding GlyRS, was the first member identified, mutations in which 
cause a dominant axonal form of CMT (CMT2D)*. The canonical 
function of this evolutionarily ancient enzyme is to catalyse the ligation 
of glycine to the 3’ end of its cognate tRNA as the first step of protein 
synthesis. Interestingly, emerging evidence has revealed that GlyRS in 
multicellular organisms, like several other tRNA synthetase family 
members, has acquired the ability to be secreted from cells and as an 
extracellular protein can influence cell signalling*’. 

More than a dozen missense GARS mutations (GlyRS™?P) have 
been found in CMT2D patients, with varying degrees of genetic evid- 
ence for disease association’’ *. Among them, three mutations (E71G, 
L129P and G240R) are the most tightly linked to the disease"’ (Fig. 1a). 
Spontaneous and N-ethyl-N-nitrosourea (ENU)-induced missense 
mutations in mouse Gars also cause CMT2D-like phenotypes'*” 
(Fig. la). These dominant mutations are found throughout the 
primary sequence of GlyRS, with some affecting the aminoacylation 
enzyme activity whereas others do not’. Mice with a heterozygous 


deletion of the Gars gene and a 50% reduction in glycyl-tRNA synthe- 
tase activity are normal'*. Furthermore, overexpression of the wild- 
type GlyRS (GlyRS™”") in a CMT2D-disease mouse fails to rescue the 
neuropathy'’. These genetic experiments indicate that CMT2D may 
arise from an abnormal activity gained by GlyRS“™''”” rather than a 
general defect in tRNA aminoacylation as initially suspected. 

In common with most class II tRNA synthetases, GlyRS functions as 
a dimer for aminoacylation. Interestingly, despite being dispersed in 
three separate domains of GlyRS, all known CMT2D-causing muta- 
tions are located near the dimer interface in the GlyRS crystal struc- 
ture’’. We found that five different human mutations associated with 
CMT2D caused a conformational opening in GlyRS that exposes new 
protein surfaces to solution’” (Fig. 1 b). To test if this conformational 
change also occurs in GlyRS™'”? linked to CMT-like phenotypes in 
mice, we performed hydrogen-deuterium exchange labelling on 
GlyRS(P234KY) and found that the mouse mutation likewise opens 
new surfaces of the GlyRS protein to solution (Fig. 1c and Extended 
Data Fig. 1). These findings suggest that the abnormal ‘opening’ is 
shared by many CMT2D mutants. 
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Figure 1 | Dispersed CMT2D mutations consistently cause neomorphic 
structural opening at the dimer interface of GlyRS. a, Distribution of 15 
CMT2D-associated dominant mutations in the three domains of the cytosolic 
human GlyRS. The three strongest pathogenic mutations are highlighted in 
green. Two mutations identified in mice (asterisks) are labelled with their 
corresponding residue numbers in the human protein. b, Human GlyRS 
structure (monomeric subunit) viewed from dimer interface. Consensus 
opened-up areas caused by five CMT2D mutations are labelled in red’’. 

c, Opened-up areas (red) by the P234KY mutation (>10% increase in 
deuterium incorporation relative to wild-type GlyRS). 
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The new surfaces exposed by mutations in GlyRS™''”? could lead 
to neomorphic protein interactions. This prompted us to search for 
binding partners unique to GlyRS“"””. We performed a candidate- 
protein screen by in vitro protein pull-down assays. Because motor 
neurons are the most frequently affected neuronal type in 
CMT2D*"*”, our initial screen focused on molecules that are highly 
expressed by motor neurons and that have been linked to motor neu- 
ron diseases or defects. We detected strong binding between the recep- 
tor Nrp] and several GlyRS“™"*? mutants including P234KY and the 
three (E71G, L129P and G240R) with the strongest link to CMT2D in 
patients! (Fig. 2a and Extended Data Fig. 2a, b). In contrast, GlyRS”" 
did not bind Nrp1 strongly, and GlyRS“''”” failed to bind to other 
motor neuron proteins such as TrkB (also known as Ntrk2), Dec, 
Robol and Unc5C (Fig. 2a and Extended Data Fig. 2a, b). To confirm 
that this binding specificity occurs in vivo, we performed immunopre- 
cipitations using neural tissues from wild-type and P234KY-mutated 
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significantly more GlyRS from CMT2D mice than wild-type 
controls (Fig. 2b), indicating that GlyRS“1??(P234KY) binds to 
Nrp1 in vivo. Furthermore, we also found that the GlyRS-NRP1 inter- 
action is significantly stronger in CMT2D patients carrying the L129P 
mutation than in healthy individuals (Fig. 2c). To quantify these 
interactions, we used biolayer interferometry and a biosensor with 
immobilized Nrp1 on the surface. GlyRS“" binding was undetectable 
at 1M, whereas GlyRS“™'”? bound significantly more strongly, 
with a dissociation constant (Ka) of 29.8+6.3nM for L129P and 
208.7 + 53.8 nM for P234KY. 

Next, we mapped the site where GlyRS“™'?? binds to Nrp1 using 
pull-down assays with domain-deletion constructs. Removal of the 
extracellular a and c domains of Nrp1 did not alter GlyRS™"”” bind- 
ing, whereas the extracellular Nrp1 b1 domain alone was sufficient to 
bind GlyRS™"??(P234KY) (Fig. 2d and Extended Data Fig. 2c). 
Because the b1 domain is the binding site of VEGF-Aj¢5, this finding 
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Figure 2 | GlyRS“!”” specifically binds Nrp1 and antagonizes VEGF- 
Nrp1 interaction. a, In vitro pull-down (PD) of GlyRS™”” proteins by the 
ectodomain of Nrp1 but not TrkB. b, Co-immunoprecipitation (IP) to detect 
GlyRS-Nrp1 interaction in neural tissues of wild-type (WT) and P234KY- 
MTP mice (CMT). ¢, Co-immunoprecipitation to detect GlyRS-Nrp1 
interaction in lymphocytes from CMT2D patients carrying the L129P mutation 
(n = 5) and from healthy individuals (n = 3). d, Domain mapping using 

in vitro pull-down identifies the b1 domain of Nrp1 as the main binding site of 
GlyRS™?”. e, f, In vitro pull-down assay showing the competition between 


GlyRS“?P (P234KY) and VEGF-Aj¢5 proteins for Nrp1 (b domains) binding. 
g, j, Schematic of facial motor neuron migration (g) and facial motor nucleus 
(j) in open-book preparations of wild-type (left half) and VEGF-Nrp1- 
deficient mouse hindbrains at E13.5 (right half)”°. h, i, Fluorescence labelling of 
facial motor neuron somata and axons by Is!“:GFP-F on one side of E13.5 
mouse hindbrain of open-book preparation. k, 1, Immunostaining of Isl- 
positive facial nucleus on one side of the E13.5 mouse hindbrain of open-book 
preparation. Scale bars, 200 jim. 
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VEGF-Aj¢5 to this region of Nrp1. Using pull-down assays, we found 
that increasing concentrations of P234KY or L129P GlyRS™?? com- 
pete with VEGF-Aj¢5 binding to the b domains of Nrp1 (Fig. 2e and 
Extended Data Fig. 2d). ). Conversely, increasing levels of VEGF-Aj65 
displaced P234KY or L129P GlyRS~™'"”? from the b domains (Fig. 2f 
and Extended Data Fig. 2e). 

These observations led us to focus on GlyRS 5 protein in the extra- 
cellular environment. Recent studies of GlyRS”' detected secretion 
from immune cells®, so we first examined whether GlyRS™’" is released 
by cell types relevant to the peripheral nervous system and motor 
function. Indeed, endogenous GlyRS“’" was detected in the culture 
media of mouse motor neuron and differentiated myotube cell lines, 
but not of undifferentiated myoblasts (Extended Data Fig. 3a-e). 
We found that secreted GlyRS“" was enriched from extracellular 
sources using procedures that concentrate micro-vesicles (30-100 nm, 
‘exosomes’) (Extended Data Fig. 4a, b). Extracellular levels of Glyrs“t 
were diminished by application of the exosome-pathway inhibitor 
GW4869 and enhanced by the exosome-pathway activator monensin 
(Extended Data Fig. 3a—f). Next, we checked whether CMT2D-causing 
mutations affect the secretion of GlyRS“™"””. Our results showed that 
GlyRS™ 7? (P234KY) was detected at levels similar to GlyRS“ i in the 
media of transfected cells (Extended Data Fig. 3g). 

Nrp1 is a well-established receptor needed for motor neuron axon 
guidance and cell body migration”®”". Our finding that GlyRS“™'’P 
and VEGF-Aj65 compete for access to Nrp1 raised the possibility that 
CMT2D mice may phenocopy some features of VEGF-Aj64 (the 
mouse equivalent of human VEGF-Aj,5) and Nrp1 mutant mice”. 
VEGF-Nip1 signalling is necessary for the caudal migration of facial 
motor neurons from rhombomere (r)4 to r6 during embryonic 
development (Fig. 2g, j)”*. This provided us with an excellent in vivo 
assay to examine the effect of GlyRS“'*? on a well-characterized 
system known to depend on VEGF-Nrp1 signalling. To track facial 
motor neuron migration we crossed CMT2D mice to transgenic 
IsIM“:GFP-F reporter animals that selectively label facial motor 
neurons’, We found that CMT2D mutant embryos developed at a 
normal rate, appeared overtly normal based on their overall morpho- 
logy, and that the expression levels of a variety of neuronal proteins 
were comparable to controls (Extended Data Fig. 5a, b). However, 
green fluorescent protein (GFP)-labelled facial motor neuron somata 
were found in ectopic anterior rhombomere locations in embryonic 
day (E)13.5 GlyrsoM1?P mutant embryos and manifested as an elon- 
gated stream across multiple rhombomeres (Fig. 2i). In contrast, most 
facial cells had completed their caudal migration to r6 in littermate 
controls at this stage (Fig. 2h). This observation was confirmed by 
immunostaining with the LIM homeodomain transcription factor Isl 
that is selectively expressed in the nuclei of facial branchiomotor neu- 
rons (Fig. 2k, | and Extended Data Fig. 5c). This facial motor neuron 
migration defect closely resembles the phenotypes of Nrp1-null 
and VEGF-Aj¢4-null mice, as previously reported (Fig. 2j)*°. 
Taken together, the protein-binding studies and embryol logical defects 
associated with GlyRS“”*? suggested that GlyRS“™’*? inhibits 
VEGF-Nip!1 signalling. 

VEGF signalling is thought to protect neurons from a variety of 
damaging insults”. Intriguingly, deficient VEGF signalling leads to 
the selective degeneration of motor neurons in mice”. To examine 
whether the VEGF-Nrp1 pathway is involved in motor deficits that 
arise from GlyRS mutations, we tested whether a genetic interaction 
could be detected between Gars™"? and Nrp1. Although the data 
described earlier suggest that GlyRS“™'”? attenuates normal Nrp1 
signalling, it is still possible that GlyRS“™"?? might activate aberrant 
signalling through Nrp1 and cause motor defects. In the first case 
motor phenotypes should get more severe as Nrpl gene dosage is 
reduced in CMT2D mice, and in the second case motor phenotypes 
should improve. To test these possibilities, we intercrossed Gars@M "7? 
mice with Nrp1 heterozygous (Nrp1*’~) animals and characterized 
the motor behaviour of the single- and double-mutant offspring. At 2 
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weeks, when motor behavioural changes were not observed in either 
GarseM™P or Nrpl */~ mutant mice, 20% of the compound hetero- 
zygous Gars™™??/Nrp1*/~ mutant mice had developed neuromus- 
cular dysfunction based on a hindlimb extension test (Extended Data 
Fig. 6a, b). At 4 weeks, CMT2D-like symptoms, including overt neu- 
romuscular dysfunction and an altered walking stride, become appar- 
ent in Gars'?? mutants, whereas Nrp1*’~ mice appeared normal 
(Fig. 3a-d and Supplementary Videos 1, 2). Strikingly, by 4 weeks, 50% 
of the Gars™"?/Nrp1*’~ mutant mice had entirely lost the ability to 
spread their legs and toes (Fig. 3a, b), and exhibited severely abnormal 
gait patterns (Fig. 3c, d and Supplementary Video 3). After postnatal 
week 4, Gars'’P/Nrp1‘’~ mutant mice began to die. Consistent 
with our biochemical studies showing that GlyRS“™"”? binds poorly 
to other signalling receptors (see Fig. 2a and Extended Data Fig. 2a), 
intercrosses between GarsM?” and TrkB*’”, Dec*’", Robo1*’~ and 
Unc5C*’~ heterozygous mice did not worsen the neuromusclular 
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Figure 3 | Nrp1 is a genetic modifier of CMT2D. a, b, Hindlimb extension 
test at 4 weeks. ***P < 0.001 (Mann-Whitney test). CMT, P234KY- 
Gars™P mice. c, Hindlimb footprints of wild-type and mutant animals at 
4 weeks. Gars“M™??/Nrp1*’~ mutant mice exhibit disrupted gait patterns 

of different degrees (mild, severe). Note that severe cases show inability to walk. 
d, Stride length of wild-type and mutant animals at 4 weeks. e, f, Neuromuscular 
junctions immunostaining (NMJ) in the gastrocnemius muscles of 4-week-old 
mice with the motor nerve terminal labelled in green and acetylcholine receptors 
on the muscle labelled in red. Data are presented as mean values + standard error 
of the mean (s.e.m.). # = 3 mice per group. Scale bar, 50 um. g, h, Myelinated 
axons from sciatic nerves of 4-week-old mice. Scale bar, 20 um. h, Histogram 
showing the quantification of axon numbers with the diameter larger than 2 jim. 
n= 3 mice per group. d, f, h, *P< 0.05, **P < 0.01 (t-test). 
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phenotypes in the compound heterozygotes (Fig. 3b and Extended 
Data Fig. 6c). 

The motor defects in Gars“?? and Gars™'??/Nrp1*/~ mutants 
were accompanied by marked pathological changes in the peripheral 
nerves and synaptic contacts with muscle fibres. Neuromuscular junc- 
tions (NMJs) displayed a normal apposition of nerve fibres and post- 
synaptic acetylcholine receptors in wild-type and Nrp1*’~ animals, 
while partially innervated and completely denervated NMJs were pre- 
sent in 4-week Gars™'? mutants (Fig. 3e, f). The loss of nerve 
terminals at NMJs was markedly increased in Gars“M'??/Nrp1*/— 
mutants (Fig. 3e, f). Likewise, at 4 weeks of age many large-diameter 
axons were absent from the sciatic nerves of Gars“M’*? mutants 
compared with wild-type and Nrp1*’~ littermates (Fig. 3g, h and 
Extended Data Fig. 7a—c). The absence of large-diameter axons was 
even more dramatic in 4-week Gars“™P/Nrp1*’— compound 
mutants, and was comparable to the extreme axonal dystrophy 
observed in late-stage CMT2D mutants (Fig. 3g, h and Extended 
Data Fig. 7d)'°. These findings demonstrate that Nrp1 is an impor. 
tant genetic modifier of CMT2D pathology and that GlyRS““?? 
antagonizes normal Nrp1 signalling rather than activating aberrant 
signalling. 

These findings prompted us to test whether VEGF overexpression 
could counteract GlyRS“™’”? and help to slow the loss of motor 
function in CMT2D mice. A lentiviral vector encoding either VEGF- 
Ajos or GFP was injected bilaterally into the hindlimb muscles of 
Gars™P mutant mice at postnatal day 5, before the onset of overt 
motor defects (Fig. 4a and Extended Data Fig. 8). By 4 weeks of age, we 
began to observe a reduction of limb strength in the control GFP- 
treated Gars'?P animals using an inclined plane test (Fig. 4b). 
However, VEGF-A j65-treated animals retained greater neuromuscular 
capacity with significantly higher scores (Fig. 4b). By 7 weeks, 
Gars™'?? animals exhibited a disrupted gait pattern with shortened 
hindlimb stride length, while VEGF-Aj,5-treated animals maintained 
a significantly longer walking stride (Fig. 4c). Likewise, VEGF treat- 
ment significantly improved the motor performance of Gars“MT?? 
mutants in the rotarod test (Fig. 4d). To minimize the possible influ- 
ence from the natural variation in disease progression among indi- 
vidual animals, we compared the effect of VEGF treatment to GFP 
controls by separately treating each hindlimb from the same animal. 
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Figure 4 | VEGF treatment improves motor function in CMT2D mice. 

a, Diagram showing bilateral intramuscular injection of lentivirus (LV) into 
mouse hindlimbs at postnatal day (P)5. b, Inclined plane test of 4-week-old 
animals. c, Walking strides of 7-week-old animals. d, Rotarod test of 2-month- 
old animals. *P < 0.05, **P < 0.01, ***P < 0.001 (t-test). 
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Lentiviral vectors encoding VEGF-Aj¢s5 or GFP were injected unilat- 
erally into each hindlimb of the same Gars“'”? mutant mouse at 
postnatal day 5 (Extended Data Fig. 9a). At 5 weeks, GFP-treated 
hindlimbs developed severe muscle weakness, largely losing their abil- 
ity to extend. In contrast, the contralateral hindlimbs treated with 
VEGF-Aj¢5 retained significant function (Extended Data Fig. 9b-e). 
These results suggest that VEGF treatment significantly ameliorates 
the loss of motor function in CMT2D mice. 

A number of neurotropic factors have been tested as broad- 
spectrum strategies to enhance neuronal survival and treat motor 
diseases***°. This raised the possibility that VEGF might slow the 
progression of CMT2D pathology by functioning as a generic trophic 
factor rather than as a specific agent to restore normal VEGF-Nrp1 
signalling. Therefore, we tested the effects of lentivirus-mediated 
expression of GDNF, a potent neurotrophin that has been used to 
enhance motor function and survival in mouse models of amyotrophic 
lateral sclerosis”®”’. Unlike VEGF-Aj6;, we found that GDNF failed to 
slow disease progression in Gars“'*P mice (Fig. 4b, d and Extended 
Data Fig. 9f). Next, we tested whether VEGF-mediated motor sparing 
is dependent upon Nrp1 binding, by exploiting the binding specificity 
of VEGF protein isoforms. VEGF-A, has overlapping functions with 
VEGF-Aj65 but lacks high-affinity Nrp1 binding’*. We found that 
VEGF-A, , treatment failed to ameliorate the loss of motor function 
in CMT2D animals (Fig. 4b, d and Extended Data Fig. 9g). These data 
support a model in which VEGF treatment helps to guard against the 
motor loss arising from the aberrant activity of GlyRS“™"”” by restor- 
ing VEGF-Nip1 signalling. 

Our results identify the Nrp1 gene as an important genetic modifier 
for CMT2D, and link the selective pathology of this disease to the 
neomorphic binding of GlyRS“™'”? to the receptor Nrp1. Although 
we found the same neomorphic binding to NRP1 in several human 
GlyRS“™”P mutants that are strongly disease associative (Fig. 2a and 
Extended Data Fig. 2a, b), we cannot rule out the possibility that some 
GlyRs™”?P mutants may interact with other extracellular and/or 
intracellular targets. Nevertheless, our findings strongly suggest that 
the VEGF-NRP1 pathway is an actionable target for treating CMT2D 
(Extended Data Fig. 10). While the exact role of VEGF in the motor 
system remains poorly defined, VEGF-deficient mice selectively 
develop symptoms of motor neuron disease over time’’. The direct 
antagonism of VEGF-Nrp1 signalling by GlyRS“'*P found here 
further indicates that deficient VEGF signalling may represent a com- 
mon pathogenic pathway that is susceptible to abnormal activity in 
other motor neuron diseases. A broad implication from this work is 
that the molecular basis of selective neuronal vulnerability in neuro- 
degenerative diseases may arise from the neomorphic activity of mis- 
folded proteins interacting with susceptible signalling targets in 
specific cell types. This conceptual framework may be applied for 
identifying additional druggable targets to treat neurodegenerative 
diseases, including other forms of CMT. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Mice. The following strains of mice were used in this study: wild-type C57BL/6] 
(JAX), P234KY-CMT2D mutant (background includes a mix of C57BL/6, CB6 
and CAST)", Tg (Hb9:GFP)", Tg (IslM“:GFP-F), Nrp] mutant”, TrkB 
mutant*’, Robol mutant**, Dcc mutant*®, and Unc5C mutant®’. Both male and 
female mice were used in this study. All experiments were done in accordance with 
Institutional Animal Care and Use Committee animal protocols and BSL2+ safety 
protocols, on animals housed in groups on a 12-h light-dark cycle. 
Recombinant GlyRS expression and purification. Carboxy-terminal His-tagged 
human GlyRS“" and GlyRS“™””? proteins were individually cloned into pET21b 
vector (Novagen) and expressed in Escherichia coli BL21 (DE3) host cells at 25 °C. 
The proteins were purified by Ni-NTA agarose affinity column followed by ion- 
exchange monoQ column and size-exclusion column Superdex 200 (GE 
Healthcare). To prepare non-tagged human GlyRS“” and GlyRS“™”? proteins, 
the GARS gene was fused with an amino-terminal His-SUMO tag, cloned into 
pET28a vector (Novagen), and expressed as His-SUMO-GlyRS fusion proteins in 
Escherichia coli BL21 (DE3) cells. The fusion proteins were purified with a Ni- 
NTA agarose affinity column, and then subjected to homemade Ulp1 protease to 
remove the His-SUMO tag. The non-tagged GlyRS proteins were separated from 
the tag by flowing through the Ni-NTA column again. 

Hydrogen-deuterium exchange analysis. Solution-phase amide hydrogen-deu- 
terium exchange (HDX) was performed with a fully automated system as 
described previously*’. Briefly, 4 il of His-tagged GlyRS??***Y or GlyRS“’" was 
diluted to 20 ul with D,O-containing HDX buffer to a final concentration of 
101M, and incubated at 4°C for 10, 30, 60, 900, and 3,600 s. Following on- 
exchange, unwanted back exchange was minimized by adding 30 pl of 1% TFA 
in 5 M urea to denature the protein (held at 1 °C). Samples were then passed across 
an immobilized pepsin column (prepared in house) at 50 pl min™! (0.1% TFA, 
15°C), and the resulting peptides were trapped onto a C8 trap cartridge (Thermo 
Fisher, Hypersil Gold). Peptides were eluted across a 1 mm X 50mm C18 high- 
performance liquid chromatography (HPLC) column (Hypersil Gold, Thermo 
Fisher) with a 4-40% CH3CN gradient and 0.3% formic acid over 5 min at 2 °C, 
and electrosprayed directly into an Orbitrap mass spectrometer (LTQ Orbitrap 
with ETD; Thermo Fisher). Data were processed with in-house software* and 
visualized with PyMOL (DeLano Scientific). The difference in HDX between 
Glyrs’?4“¥ and GlyRS”" was calculated by subtracting the average percentage 
deuterium uptake for GlyRS? 234KY from that for GlyRs“7 after 10, 30, 60, 300, 900 
and 3,600 s of on-exchange. Please note that we cannot directly compare the 
numbers obtained here for GlyRS?’?™*** with those for other GlyRS““"”? mutants 
from the previous study’’, because this and the precious analysis were carried 
out in two different laboratories with different instruments and experimental 
procedures. 

Detection of GlyRS proteins in cell cultures. NSC-34 motor neuron cells 
(Cellutions Biosystems) and C2C12 mouse adherent myoblasts (from A. 
Patapoutian’s laboratory at The Scripps Research Institute) were maintained in 
DMEM supplemented with 10% heat-inactivated fetal bovine serum (FBS) and 1% 
penicillin (Life Technologies) at 37 °C in a humidified incubator containing 5% 
CO,. These cell lines had not been recently authenticated and tested for myco- 
plasma contamination. Myogenic differentiation of C2C12 myoblasts was induced 
by substituting the FBS with 2% horse serum. The cells were further cultured 
in Opti-MEM Reduced Serum Medium (Life Technologies) for 16h. Brefeldin 
A (Cell Signaling Technology), GW4869 (Sigma-Aldrich), or monensin 
(eBioscience) was added to the cell medium for 2 h before the cells and the medium 
were separated for western blot analysis. After removal of cell debris by spinning 
the medium at 300g for 10 min, the supernatant was concentrated using Amicon 
Ultra-4 Centrifugal filter (Millipore). Cells were lysed using cell lysis buffer 
(ATCC) with added protease inhibitor cocktail (Roche). The following antibodies 
were used for western blot analysis: mouse anti-GlyRS (H00002617-B01P, 
Abnova; 1:1,000), rabbit anti-GAPDH (#3683, Cell Signaling Technology; 
1:1,000), rabbit anti-vWF (sc-14014, Santa Cruz; 1:50), and rabbit anti-TSG101 
(MABC649, Millipore; 1:1,000). To study the effect of CMT2D-causing mutation 
on GlyRS secretion, constructs overexpressing V5-tagged GlyRS??***Y or 
GlyRS“ were transfected into COS7 cells using lipofectamine 2000 
(Invitrogen). The expression and secretion of GlyRS proteins were detected by 
western blot analysis using anti-V5 antibody (R960-25, Invitrogen; 1:5,000). 
Exosome purification and analysis. The general idea of exosome purification by 
differential centrifugation is depicted in Extended Data Fig. 3a. Supernatants from 
NSC-34 cell media were subjected to successive centrifugation steps at 4 °C: (1) 
200g for 10 min to eliminate floating cells; (2) 2,000g for 10 min to discard large 
dead cells; (3) 10,000g for 1h to remove cell debris and cellular organelles such as 
mitochondria and lysosomes. At each step, the pellet was thrown away and the 
supernatant was used for the following step. The final supernatant was centrifuged 
at 100,000g at 4 °C for 4h to pellet micro-vesicles that are commonly known as 
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‘exosomes’. The final supernatant and the exosome fraction were analysed by 
western blot analysis using the antibodies specified earlier and rabbit anti-Bip 
antibody (#31838, Cell Signaling; 1:1,000). 

In vitro pull-down assay. Recombinant rat Nrp1-Fc, mouse TrkB-Fc, mouse 
Dec-Fc, rat Robol-Fc and human Unc5c-Fc extracellular domain—Fe chimaeras 
(R&D systems) were bound to Protein G beads. Purified non-tagged GlyRS™" and 
GlyRS™”? proteins were individually added to the receptor-immobilized beads 
and incubated for 1h at 4°C. After removal of unbound GlyRS proteins, SDS- 
loading buffer was directly added to the beads to elute the receptor and its bound 
GlyRS. The amount of GlyRS bound to receptors was analysed by western blot 
analysis using mouse anti-GlyRS antibodies (H00002617-B01P, Abnova; 1:1,000). 
Co-immunoprecipitation and western blot analysis using mouse tissues. 
The interaction between endogenous GlyRS and Nrp1 proteins was detected by 
co-immunoprecipitation. Adult mouse neural samples were lysed using RIPA 
buffer (Cell Signaling) containing 20mM Tris-HCl (pH 7.5), 150mM NaCl, 
1mMNa,EDTA, 1mMEGTA, 1% NP-40, 1% sodium deoxycholate, 2.5 mM 
sodium pyrophosphate, 1 mM £-glycerophosphate, 1 mM Na3VO,, and 1 pg ml 
leupeptin. Immunoprecipitation was performed with rabbit anti-Nrp1 antibody 
(NBP1 40666, Novus Biologicals; 1:100) and the precipitates were subjected to 
western blot analysis using mouse anti-GlyRS antibody (H00002617-B01P, 
Abnova; 1:1,000). 

Western blot was performed to analyse the expression levels of various neuronal 

proteins in E12.5 wild-type and CMT2D mutant neural tissues. The following 
primary antibodies were used: mouse anti-GlyRS (H00002617-B01P, Abnova; 
1:1,000), rabbit anti-Nrp1 (#37258, Cell Signaling; 1:1,000), rabbit anti- VEGFR1 
(#36-1100, Life Technologies; 1:1,000), rabbit anti-VEGFR2 (#24798, Cell 
Signaling; 1:1,000), mouse anti-B-actin (#3700, Cell Signaling; 1:1,000), mouse 
anti-Dcc (AF5, Abcam; 1:100), rabbit anti-Robol (NB100-60458, Novus 
Biologicals; 1:2,000), mouse anti-NF (2H3, DSHB; 1:100), mouse anti-MAP2 
(MAB364, Millipore; 1:500), rabbit anti-GAP43 (AB5220, Millipore; 1:1,000), 
and rabbit anti-B-catenin (#9587, Cell Signaling; 1:1,000). 
Co-immunoprecipitation using CMT2D patient samples. Peripheral blood was 
drawn from CMT2D patients carrying the L129P mutation and control indivi- 
duals after obtaining their written informed consent. The study complies with the 
ethical guidelines of the Medical University of Sofia, Bulgaria and University of 
Antwerp, Belgium. Lymphocytes were isolated on a Ficoll-Paque gradient, trans- 
formed with Epstein-Barr virus and incubated at 37°C for 2h. After centrifu- 
gation, cells were re-suspended in 4ml RPMI complete medium (Life 
Technologies) supplemented with 1% phytohaemagglutinin. Cells were seeded 
on a 24-well plate and incubated at 37°C, 6% CO, for a minimum of 3 days. 
After establishment, cell lines were cultivated in RPMI1640 medium containing 
15% fetal calf serum, 1% sodium pyruvate, 1% 200 M L-glutamine and 2% peni- 
cillin/streptomycin. The harvested lymphoblastoid cells were lysed using RIPA 
buffer (Cell Signaling). Immunoprecipitation was performed with rabbit anti- 
Nrp1 antibody (Novus Biologicals) and rabbit anti-IgG (#2729, Cell Signaling) 
and the pull-down samples were subjected to western blot analysis using rabbit 
anti-GlyRS antibody (sc-98614, Santa Cruz Biotechnology; 1:500). 
Mapping of GlyRS©“” interaction domain on Nrp1. The variants of Nrp1 
extracellular domain (ECD) include intact ECD (residues Arg 23—Asp 840), b1b2c 
domain (residues Phe 273-Asp 840), ala2 domain (residues Arg 23-Asp 272), 
b1b2 domain (residues Phe 273-Phe 643), c domain (residues Thr 589-Asp 840), 
b1 domain (residues Phe 273-Asp 428) and b2 domain (residues Lys 425-Phe 
643). These variants were designed as chimaera proteins containing a 17-residue 
secretion signal peptide from myeloid cell surface antigen CD33 (gp67) at the N 
terminus and a human IgG Fc domain at C terminus, and were expressed using 
pcDNA6.0/V5-His-A vector (Life Technologies). For each Nrp1 variants, 3 1g 
plasmids were transfected using Lipofactmine 2000 (Life Technologies) into 
human HEK293 cells in a 6-well plate. Twenty hours after transfection, MEM 
media containing secreted Nrp1 variants were collected and incubated with 30 pl 
Protein A resins. The Nrpl-bound resins were divided equally into two 1.5 ml 
Eppendorf tubes and incubated with 51g of recombinant GlyRS“'?? or 
GlyRS“” in 1 ml of Washing Buffer (PBS, 5mM B-ME, 0.2% BSA and 0.05% 
Triton X-100) for 1h. Resins were then washed three times with the Washing 
Buffer and once with PBS. The bound proteins were eluted with 30 pl of SDS- 
PAGE sample buffer and subjected to western blotting analysis using mouse anti- 
GlyRS (H00002617-B01P, Abnova; 1:1,000) and rabbit anti-His antibodies (RHIS- 
45P-Z, ICL Lab; 1:10,000) to detect GlyRS and the Nrp1 variants, respectively. 

The bl (residues Phe 273-Asp 428), b2 (residues Lys 425-Glu 586), and b1b2 
domain (residues Phe 273-Glu 586) of Nrp1 fused with an N-terminal glutathione 
S-transferase (GST) tag was cloned into the pET28a vector (Novagen), expressed 
in E. coli BL21(DE3) cells and purified with GST resin (Qiagen). GST or 
GST-Nrp1 fusion proteins was incubated with 20] GST resin and then bind 
with non-tagged wild-type or P234KY GlyRS in 1 ml of Washing Buffer (1x PBS, 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


5mM BME, 0.2% BSA and 0.05% Triton X-100) for 1h. GST resins were 
washed three times with Washing Buffer and once more with PBS. The bound 
proteins were eluted with SDS-PAGE sample buffer, and subjected to western 
blotting analysis. 

Competition assay between VEGF-Aj,5 and GlyRS™”” for Nrp1 binding. In 
each experiment, 5 ig of GST-b1b2 protein was bound with 15 yl of GST resin in 
1 ml Washing Buffer on ice. The competition was tested in both directions. In one 
direction, 30nM of P234KY GlyrSoM?P was added to GST-b1b2 with an 
increasing concentration of human VEGF-Aj¢5 (IBL); in the opposite direction, 
30nM of VEGF-Aj¢5 was added with an increasing concentration of P234KY 
GlyRS“P. After the resins were washed three times with the Washing Buffer 
and once with PBS, proteins were eluted with SDS-PAGE sample buffer, and 
analysed by western blot using rabbit anti- VEGF-A (ABS82, Millipore; 1:2,000), 
mouse anti-GlyRS (H00002617-B01P, Abnova; 1:1,000) and rabbit anti-GST 
(#2622, Cell Signaling; 1:1,000) antibodies. 

Immunostaining and imaging. Immunostaining of NMJs was performed as 
described'*. Cocktails of the following primary antibodies were used to visualize 
nerves: rabbit anti-NF (AB1991, Millipore; 1:1,000), rabbit anti-synaptophysin 
(A0010, Dako; 1:2,000), and mouse anti-SV2 (DSHB; 1:1,000). Secondary anti- 
bodies were Alexa-488 or -647 conjugated (Molecular Probes/Invitrogen; 1:1,000). 
Tetramethylrhodamine-conjugated o-bungarotoxin (T-1175, Molecular Probes/ 
Invitrogen; 1:1,000) was used to visualize acetylcholine receptors (AChRs) on muscles. 
The occupancy of NMJs is measured by examining the overlap of the motor nerve 
terminal (green) with AChRs on the muscle (red). At least 40 randomly selected NMJs 
were examined from each of three mutant and three control mice. The flat-mount 
preparations of hindbrains were performed as previously described”*. Rabbit anti- 
Isl1/2 (ref. 39) was used to label facial motor nuclei by whole-mount immunostaining. 
The distance between the facial motor nucleus and trigeminal nucleus was measured 
for each embryo. Each distance was further normalized to the relative distance of the 
wild-type facial motor nucleus. Rabbit anti-VEGF (ab52917, Abcam; 1:200) was used 
to determine the expression of VEGF in muscle fibres. 

Bright-field and fluorescence images of whole embryos were obtained using a 
0.8X objective on Zeiss Lumar.V12 fluorescence stereomicroscope. Confocal 
images were obtained using 10X and 20X objectives on Olympus Fluoview 
1000 confocal microscope. 

Hindlimb extension test. Mice were suspended by the tail and the extent of 
hindlimb extension was observed over 10 s. A score of 2 corresponded to a normal 
extension reflex in hindlimbs with splaying of toes. A score of 1 corresponded to 
clenching of hindlimbs to the body with partial splaying of toes. A score of 0 
corresponded to clasping hindlimbs with curled toes. Three tests were performed 
for each mouse with 5 s intervals. A score of 1.5 or 0.5 corresponded to behaviours 
between 2 and 1, or between 1 and 0, respectively. 

Footprint test. Blue ink was applied to the hind paws of each mouse and the 
animal was placed in a narrow alley (9 X 80 X 25 cm) with the floor covered with 
white paper. A home cage was placed at the end of the alley for the animal to walk 
to while leaving its footprints on the paper. Stride length was assessed by mea- 
suring the average distances of at least three consecutive steps on each side. 
Inclined plane test. Hindlimb strength was assessed at postnatal 4 weeks using the 
inclined plane test. Briefly, animals were placed on an inclined plane, and the angle 
of incline was gradually increased starting from 15°. The maximum angle at which 
the animal could maintain its position for 5 s constituted the inclined plane score. 
The test was performed three times for each mouse. 

Rotarod test. Motor coordination was assessed with a rotarod apparatus 
(Economex, Columbus Instruments). The mice were first placed on the stationary 
rod (0 r.p.m.) to acclimate them to the apparatus, followed by a trial at a rotation 
speed of 1 r.p.m. for 3 min or until a fall occurred. For testing, the rotation of the 
rotarod was accelerated from 0 r.p.m. with an accelerating rate (0.1 r.p.m. min’). 
The latency of each mouse to fall was monitored for three consecutive trials and 
the intra-trial interval for each animal was about 20 min. The average time of three 
trials was used as a measure of motor performance. 


Virus preparation and injection. The complementary DNAs encoding GDNF, 
VEGF-A;3; or VEGF-Aj¢s were cloned into lentiviral vector (p156RRLsin 
PPTCMVGFPPRE) between BamHI and Sall sites. All lentiviruses were produced 
by the GT3 core facility at the Salk Institute with a titre of 1 x 10'7-2 x 10° 
genome copies per ml. Injections were performed at P5 (+ 1 day) after anaesthet- 
izing pups on ice. Multiple injections (n = 8) of virus (5 pil for each limb) into a 
variety of hindlimb muscles were performed with a Hamilton syringe. On the basis 
of the expression pattern of GFP reporter, we found that the lentiviruses mainly 
infect muscle fibres. 

Nerve histology and imaging. Mouse sciatic nerves were dissected and fixed in 
2.5% glutaraldehyde in 0.1 M phosphate buffer. Nerve samples were then osmi- 
cated, dehydrated and embedded in araldite resin. Transverse nerve sections 
(1 um) were cut on a Leica RM2065 microtome and stained with methylene blue 
Azure II. Images were collected on a Leica DMR microscope or an Olympus BX61 
microscope. Axon numbers were determined from two non-overlapping fields 
(50 X 50 jm) from each of three mutant and three control nerve samples. Axon 
diameters were measured by Image J. 

Statistics. All graphs and data generated in this study were analysed using 
GraphPad Prism 6.0 Software (MacKiev) or Excel (Microsoft). Two-tailed 
unpaired t-tests with Welch’s correction using parametric distribution, two-tailed 
Mann-Whitney test using unparametric distribution, or two-tailed paired 
Wilcoxon test using unparametric distribution were performed to measure differ- 
ences from at least three independent biological replicates. P< 0.05 was consid- 
ered significant. These tests do not require similar variance of the data between the 
groups that are being statistically compared. The normality of the data was deter- 
mined by D’Agostino-Pearson omnibus test and Kolmogorov-Smirnov test. No 
statistical methods were used to predetermine sample sizes, but our sample sizes 
are similar to those generally used in the field. 

For all animal studies, analyses were performed on approximately equal num- 
bers of male and female mice selected randomly from populations, and no sex- 
specific differences in the disease progression were identified. All behavioural 
experiments were performed in double-blind fashion, and stressed animals were 
excluded from the analysis. 
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Catalytic Domain—¢——-_— Anticodon Binding Domain 
VREGDEQRTF 33'S FPAVVAPFsqKC SVLPLSQN5nQ EFM PFVKELagS EALTRH GVS gH KVDDSSG SlggS RRYARTDEIg (GVA FGVTIDFepDTIVNK TPHTAg30 


TLRDRDSM RQgpTRAEIS ELPS gIVQDLANG Niggf WADVEARYP og LFEG QETGKKgETIEE me 11~30% mmm 31~50% mmm >51% 
Extended Data Figure 1 | Hydrogen-deuterium exchange analysis to indicating overall structural opening. The regions having significant changes 
compare GlyRS“™ "?(P234KY) and GlyRS“” in solution. A global increase  (>10%) in deuterium incorporation are highlighted under the human cytosolic 
(15%) in deuterium incorporation for the mutant GlyRS was observed, GlyRS sequence with different colour codes (see box). 
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Extended Data Figure 2 | Characterization of the binding activity of 
GlyRS“™””. a, In vitro pull-down of GlyRS“™’?(P234KY) proteins with the 
ectodomains of Nrp1, TrkB, Dcc, Robol and Unc5C proteins. Note the much 
stronger binding of GlyRS““'”? with Nrp1 compared with other receptors. 
GlyRS was detected by immunoblot with anti-GlyRS antibody; similar amounts 
of input receptors were visualized by Coomassie blue staining. b, In vitro pull- 
down of GlyRS“™””” proteins with the ectodomain of Nrp1. In addition to 


Nrp1 


63. 


L129P and P234KY, direct binding to Nrp1 was detected for E71G and 
G240R GlyRS“™™”. c, GST pull-down to confirm that b1 domain of Nrp] is 
the main binding site of GlyRS““”?. The amount of GST and GST fusion 
proteins used for GlyRS“"”” binding was visualized by Ponceau staining. 

d, e, In vitro pull-down assay showing the mutual competition between 
GlyRS“™?P(L129P) and VEGF-Aj¢5 for Nrp1 binding. 
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Extended Data Figure 3 | Detection of GlyRS proteins in the cell medium. 
a, c, e, Western blot analysis of the GlyRS protein levels in NSC-34 motor 
neurons (a), C2C12 cell-differentiated myotubes (c) and undifferentiated 
C2C12 myoblasts (e). The observation that differentiated myotubes also secrete 
GlyRS raises the possibility that muscles, which are directly innervated by the 
peripheral motor neurons, might contribute to the disease pathology. The level 
of GlyRS proteins in cell medium is diminished by application of the exosome- 
pathway inhibitor GW4869, but not by brefeldin A (BFA), an inhibitor of the 
classical endoplasmic reticulum (ER)-to-Golgi secretory pathway. GAPDH 
(cytoplasmic protein), vWF (secretory protein through ER-Golgi pathway) 
and TSG101 (exosomal protein) are used as controls. b, d, Quantification of 


y 
32. 


——|—— 


GlyRS protein level indicated in a, c. Data are presented as the mean + s.e.m. of 
three independent experiments (*P < 0.05, t-test). f, Western blot analysis of 
the GlyRS protein level in NSC-34 motor neurons. The level of GlyRS proteins 
in the cell medium is increased by the treatment of monensin (MON), an 
activator for microvesicle release by regulating the intracellular calcium 
level*®*". Vehicle-treated cells were used as control (Ctrl). g, Western blot 
analysis of the GlyRS protein level in Cos7 cells transfected with plasmids 
encoding GlyRS“’” and GlyRS“7"?(P234KY). The expression of GlyRS 
proteins was detected by immunoblot with antibody to V5 epitope tag. GAPDH 
was used as control. Note the similar level of GlyRS“M’” and GlyRS™" in the 
media of transfected Cos7 cells. 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


: b Cell lysate Cell medium 
x 
a 
& Pa @ 
e < s 
Cell medium NSC-34 cells BS g ¢ 
ry S nd 
(kD) 


GlyRS 


53. 
TSG101 


Floating cells 


Dead cells 


Differential Centrifugation 


93. 
Bip 
Cell debris 68 
Mitochondria 
a _ =. 


Exosomes Supernatant 


VWF -__— e : 


Extended Data Figure 4 | Detection of GlyRS proteins in exosome-enriched supernatant fractions. The quality of the exosome preparation was controlled 
fractions. a, Diagram showing the procedure of exosome separation from the _ by detection of TSG101 (exosomal protein), Bip (ER-associated protein), 

cell medium of NSC-34 cells by differential centrifugation. See Methods for GAPDH (cytoplasmic protein), and vWF (secretory protein through 

details. b, Western blot analysis of proteins associated with various fractions. | ER-Golgi pathway). 

GlyRS proteins were detected in the exosome-enriched fractions but not in 
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Extended Data Figure 5 | CMT2D mutant embryos have overall normal 
morphology but exhibit facial motor neuron migration defects. a, Lateral 
view of wild-type and CMT2D mutant embryos at E12.5. Motor neurons are 
specifically labelled by a transgenic fluorescence reporter, Hb9:GFP (green). 
Note overall normal morphology of CMT2D mutant embryos (CMT) 
compared with their littermate controls (wild type). b, Western bolt analysis of 
protein expression in E12.5 mouse neural tissues. The expression levels of 
various neuronal proteins appear normal in CMT2D mutants compared with 


their littermate controls. c, Considering CMT2D mutants show varying degrees 
of morphological change of facial motor nucleus, the facial motor neuron 
migration phenotype is quantified by measuring the relative distance of the 
facial motor nucleus between wild-type and CMT littermate embryos (each dot 
represents one facial motor nucleus, n = 6 embryos for wild type; n = 8 
embryos for CMT2D). We find that the migration of facial motor neurons is 
significantly disrupted in CMT embryos. Data are presented as the 

mean + s.e.m. **P < 0.01 (ftest). 
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Extended Data Figure 6 | Genetic interaction between Gars and Nrp1 in 
the early stage of CMT2D. a, b, Hindlimb extension test of wild-type and 
mutant animals at 2 weeks. Note that two out of nine Gars™P:Nrp1 abe 
(CMT;Nrp1*’-) mutants exhibit hindlimb weakness with significantly lower 
scores compared with Gars“™"*P (CMT), Nrp1*’~ and wild-type littermate 
controls. c, Comparison of stride lengths in different CMT2D mutant mice at 
4 weeks old: GarsoM 17? (CMT), Gare? TrkBT (CMT;TrkB*“ ), 
Gars™™P.Dect/— (CMT;Dcc*’ ~), Gars™"? Robo1*t/— (CMT;Robo1 iat 
and Garo? UncsCt— (CMT; Unc5C*" ). No significant differences 
were observed between compound heterozygotes and their littermate 
controls (CMT). 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


Number 


Number 


iz) 


Number 


1 2 3 4 5 6 7 
Axon Diameter (um) 


CMT;Nrp1+/- 


Number a 
= nN ao 
o co co 
o o co o 


1 2 3 4 5 6 


“ 


Axon Diameter (um) 


Extended Data Figure 7 | Axonal dystrophy in CMT2D mice. 

a-d, Histograms showing the axonal diameter frequencies in the sciatic 
nerves of 4-week-old wild-type (a), Nrp1*’~ (b), GarsMEP (CMT; c), and 
CMT;Nrp1*’~ (d) mutant mice. n = 3 mice per group. Note the decreased 
numbers of larger-diameter axons in CMT;Nrp1*’ mutants compared with 
CMT, Nrp1 heterozygous, and wild-type controls. 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


LV-VEGF165-ires-GFP LV-GFP 


Extended Data Figure 8 | Expression level of VEGF in mouse muscles. The _ by immunostaining with anti-VEGF antibodies. Note that the expression 
expression level of VEGF proteins in muscle fibres of mice injected with level of VEGF in LV-VEGF-infected muscles is significantly higher than in 
lentivirus expressing LV-VEGF,¢5-IRES-GFP versus LV-GFP was determined | LV-GFP-infected control groups. 
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Extended Data Figure 9 | VEGF treatment retains limb strengthin CMT2D _ observed between both injected legs of wild-type animals in the hindlimb 
mice. a, Diagram showing that lentiviral vectors encoding GFP (LV-GFP) extension test. f, g, GDNF and VEGF-A). treatments fail to improve stride 
or VEGF-Aj6s (LV-VEGF\¢s5) are injected unilaterally into each hindlimb of length in CMT2D mice. Walking strides of 2-month-old CMT2D mice 

the same GlyRS™"”? mutant mouse at P5. c, e, At 5 weeks, LV-GEP-injected _ bilaterally injected with lentiviral vectors (LV) encoding GFP, GDNF or 


legs (L, left) of CMT2D animals have largely lost their ability to extend, VEGF-A,>;. No significant difference of hindlimb stride length was observed 
while LV-VEGF165-treated legs (R, right) retained more limb strength with between animals treated with LV-GDNF, LV-VEGF-Aj9;, and LV-GFP 
significantly higher scores in the hindlimb extension test (three out of seven controls. 


animals). P< 0.05 (permutation test). b, d, No significant difference was 
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Extended Data Figure 10 | A simplified model for the neomorphic binding 
activity of GlyRS“”. Left, GlyRS“” is a multifunctional protein with both 
intracellular and extracellular distributions. VEGF—Nrp1 signalling is an 
essential pathway for survival and function of motor neurons. (Note that VEGF 
may also act synergistically with other trophic factors, and/or maintain 
motor function indirectly by acting on Nrp1 receptors on non-motor neurons.) 
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Right, CMT2D mutations alter the conformation of GlyRS, enabling 
GlyRS“™’? to bind Nrp1. This aberrant interaction antagonizes the binding of 
VEGF to Nrp1, contributing to motor defects in CMT2D. Our results do 

not exclude the possibility that GlyRS“”””? may also interact with other 
extracellular and/or intracellular targets, related to CMT2D pathology. 


©2015 Macmillan Publishers Limited. All rights reserved 


Fs ss 


doi:10.1038/nature15382 


Yap-dependent reprogramming of Lgr5* stem cells 
drives intestinal regeneration and cancer 


Alex Gregorieff', Yu Liu’, Mohammad R. Inanlou', Yuliya Khomchuk' & Jeffrey L. Wrana!* 


The gut epithelium has remarkable self-renewal capacity that 
under homeostatic conditions is driven by Wnt signalling in 
Lgr5* intestinal stem cells (ISCs)'. However, the mechanisms 
underlying ISC regeneration after injury remain poorly under- 
stood. The Hippo signalling pathway mediates tissue growth and 
is important for regeneration”’. Here we demonstrate in mice that 
Yap, a downstream transcriptional effector of Hippo, is critical for 
recovery of intestinal epithelium after exposure to ionizing radi- 
ation. Yap transiently reprograms Lgr5* ISCs by suppressing Wnt 
signalling and excessive Paneth cell differentiation, while promot- 
ing cell survival and inducing a regenerative program that includes 
Egf pathway activation. Accordingly, growth of Yap-deficient 
organoids is rescued by the Egfr ligand epiregulin, and we find that 
non-cell-autonomous production of stromal epiregulin may com- 
pensate for Yap loss in vivo. Consistent with key roles for regen- 
erative signalling in tumorigenesis, we further demonstrate that 
Yap inactivation abolishes adenomas in the Apc™” mouse model 
of colon cancer, and that Yap-driven expansion of Apc ‘~ orga- 
noids requires the Egfr module of the Yap regenerative program. 
Finally, we show that in vivo Yap is required for progression of 
early Apc mutant tumour-initiating cells, suppresses their differ- 
entiation into Paneth cells, and induces a regenerative program 
and Egfr signalling. Our studies reveal that upon tissue injury, 
Yap reprograms Lgr5* ISCs by inhibiting the Wnt homeostatic 
program, while inducing a regenerative program that includes 
activation of Egfr signalling. Moreover, our findings reveal a key 
role for the Yap regenerative pathway in driving cancer initiation. 

To examine Yap function in intestinal epithelial cells we first ana- 
lysed Yap localization. This revealed cytoplasmic localization in most 
cells but absence in Paneth cells (Fig. 1a and Extended Data Fig. 1a). 
Upon exposure to whole-body irradiation (12 Gy), Yap was predomi- 
nantly cytoplasmic with certain cells displaying weak nuclear accu- 
mulation at 1 day post-irradiation (dpi), was mostly nuclear at 2 dpi, 
and by 4 dpi returned to a prominent cytoplasmic localization 
(Fig. 1a). Analysis of irradiated Yap“ mice revealed strongly reduced 
crypt proliferation at 3 dpi (Fig. 1b and Extended Data Fig. 1b), con- 
trasting with a previous study performed at 7 dpi that suggested 
Yap suppresses crypt regeneration*. However, we found that while 
6 dpi Yap*’“ crypts had completed regeneration, surviving Yap“ 
or Yap“4;Taz““ crypts were still recovering and thus appeared 
enlarged and hyperproliferative (Extended Data Fig. 1b, c). In mosaic 
Yap-deficient intestines at 6 dpi (see Methods), surviving wild-type 
and Yap-mutant crypts were similarly enlarged and indistinguishable 
(Extended Data Fig. 1d). To explore Yap requirement further, we 
also deleted Yap in Ler5* ISCs, which are essential for regeneration’, 
using tamoxifen-inducible Lgr5-creERT in a Rosa26-lacZ background 
(Yap“"8"*)*, Twenty-four hours after induction, Yap*'* mice 
were irradiated (12 Gy) (Fig. 1c) and the fate of Yap-deficient Ler5* 
ISCs was monitored by Yap and f-gal double immunostaining 
(Fig. 1d). In untreated controls, 88% of B-gal-expressing crypts (517 
of 587) lacked Yap, whereas in irradiated mice only 23% (91 of 394) 


lacked Yap (Fig. le). Thus, most post-irradiation B-gal* crypts 
arose from Lgr5* clones that escaped Yap deletion. In agreement, 
Yap“ crypts displayed increased apoptosis at 1 dpi, while prolifera- 
tion was unaffected (Extended Data Fig. le, f). Thus, Yap has an 
important early role in maintaining the ISC pool during intestinal 
regeneration, whereas the late hyperproliferative phase is independent 
of Yap and Taz. 

To understand Yap function in ISCs, we used ex vivo organoids’, 
which revealed severe impairment in crypt formation upon Yap dele- 
tion (Extended Data Fig. 2a, b). Similar to post-irradiated crypts 
in vivo, Yap was cytoplasmic in differentiated zones (Extended Data 
Fig. 2c), but in crypts was both cytoplasmic and nuclear (Extended 
Data Fig. 2c). Because organoids model aspects of crypt regeneration, 
we assessed Yap transcriptional programs (Methods and Extended 
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Figure 1 | Yap deficiency impairs crypt regeneration. a, Yap staining of 
intestines from untreated (i) and 12 Gy irradiated mice at indicated times (ii-iv) 
shows crypt base columnar, transit amplifying (filled arrowheads) and 
Paneth cells (open arrowheads). b, Quantification of number of surviving 
Ki67* crypts in Yap*/4 versus Yap“ mice at 3 dpi (10 Gy and 12 Gy). n> 12, 
n= scored sections (see Methods); error bars indicate s.e.m.; ***P < 0.0001. 
c, Design for tracing Yap-deficient Lgr5* ISCs. d, Yap and B-gal immuno- 
stainings from untreated and irradiated (12 Gy) Yap 8" mice (7 dpi). 

e, Survival of B-gal* and either Yap’, Yap , or mosaic Yap‘! crypts was 
quantified (see Methods). f, In situ hybridization (ISH) analysis of ISC 
marker Olfm4 in Yap*4 and Yap“ intestines (n = 3). Scale bars: a, 35 jum; 
d, f, 70 um. 
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Data Fig. 3a), including in a Yap overexpression model that does not 
alter homeostasis in vivo but impairs organoid morphogenesis 
(Extended Data Fig. 3a-c). This revealed that Yap repressed Wnt 
targets and ISC signatures (for example, Lgr5, Olfm4, Nkd1, Axin2, 
Ephb3 and Aqp4; see Supplementary Table 1), consistent with Yap and 
Taz inhibiting Wnt signalling (Extended Data Fig. 3d-f)**"° 
Accordingly, the ISC marker Olfm4 was downregulated at 1 dpi 
in vivo, as reported previously'!"’, but persisted in Yap“”“ crypts 
(Fig. 1f). Furthermore, while control crypts recovered Olfm4* cells 
by 2 dpi, in Yap“/4 mice Olfm4 expression sharply declined (Fig. 1f), 
indicating ISC loss. Thus, Yap paradoxically promotes long-term 
ISC maintenance by suppressing Wnt signalling during early crypt 
regeneration. 

We next asked how early inhibition of Wnt and Olfm4* ISCs by 
Yap ultimately drives regeneration. High Wnt activity sensitizes ISCs 
to DNA damage and p53-dependent apoptosis’, consistent with 
increased apoptosis observed in 1 dpi Yap“ crypts (Extended Data 
Fig. le). In addition, while Wnt promotes self-renewal of Ler5* ISCs, it 
also drives Paneth cell differentiation'*’’. Notably, Yap also repressed 
Paneth markers such as Wnt3, Spdef, Ccl6 and Kit (Extended Data 
Fig. 3d-f), and Yap-deficient crypts displayed marked cryptdin 1° 
Paneth cell expansion by 2 dpi, with only rare Olfm4* cells evident 
(Fig. 2a). Furthermore, while controls were strongly proliferative, with 
ectopic Paneth cells evident by 3 dpi, Yap“““ crypts were almost entirely 
comprised of Paneth cells, with few proliferating cells (Extended Data 
Fig. 4a). These results are consistent with Paneth defects in mice defi- 
cient for Hippo components Mst1 and Mst2 (ref. 16). To test directly 
whether elevated Wnt contributes to these phenotypes, we lowered 
Wnt signalling in Yap“ organoids by reducing the levels of the Wnt 
agonist Rspol (ref. 17). This rescued de novo crypt formation and 
normalized Paneth cell differentiation, despite reduced overall growth 
(Fig. 2b, c and Extended Data Fig. 4b-d). Collectively, these results 
indicate that in the absence of Yap, high Wnt signalling drives ectopic 
Paneth cell differentiation that together with elevated apoptosis con- 
spires to exhaust the ISC pool and block crypt regeneration. 
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Figure 2 | Yap prevents excessive differentiation into Paneth cells during 
regeneration. a, Fluorescence ISH on irradiated Yap‘ and Yap“/4 
intestines using Olfm4 and Paneth cell marker cryptdin 1 (n = 3). b, Yap*/4 


and Yap“ organoids cultured in reducing Rspol concentrations (as indicated) 


for 3 days (n = 4 independent cultures). c, Lysozyme stainings and Edu 
incorporation in representative organoids cultured at 150 and 38 ng ml"! 
Rspol. d, Fluorescence ISH to detect TweakR, Olfm4 and cryptdin 1 expression 
in irradiated (2 dpi; 12 Gy) Yap*/4 and Yap“ intestines (n = 3). Scale bars: 
a, c, d, 35 um; b, 70 um. 
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Nuclear accumulation of Yap further suggested that a Yap-driven 
program contributes to regeneration. We identified numerous Yap- 
induced genes such as TweakR (also called Tnfrsf12a), Areg, Edni1, Clu 
and JlIrn (Supplementary Table 1 and Extended Data Fig. 3d—f) that 
are implicated in tissue repair, inflammation and cancer’*””. In post- 
irradiated crypts, this program was strongly induced in a Yap-depend- 
ent manner, as well as in mice lacking Lats] and Lats2, which drives 
nuclear Yap (Extended Data Figs 5 and 6a). Furthermore, we observed 
Yap-dependent co-expression of TweakR and Edn1 in regenerating 
Olfm4” ISCs (Fig. 2d and Extended Data Fig. 6b), suggesting that 
Yap reprograms ISCs by transiently suppressing Wnt signalling, while 
concomitantly promoting the regenerative program. Egfr ligands 
(Areg, Hbegf and Ereg) were prominent in the Yap regenerative pro- 
gram and were of interest, because Egfr signalling is linked to ISC 
maintenance and Yap function’®**. Interestingly, treatment of 
Yap“ organoids with Egfr ligands showed that exogenous Ereg res- 
cued morphogenesis and suppressed apoptosis (Fig. 3a, b and 
Extended Data Fig. 7a), whereas additional Egf, a component of orga- 
noid growth medium, or Areg did not. The reasons for ligand specifi- 
city are unclear but may reflect non-redundant activities in vivo’. 
Notably, stromal Ereg protects against intestinal damage and drives 
colitis-associated cancer”””*. After irradiation, some Yap“ crypts 
survive and repopulate the gut, suggesting that stromal Ereg might 
rescue a handful of mutant crypts. Indeed, while Ereg was detected 
in certain cells of regenerating epithelia, it was also abundant in the 
underlying stroma (Extended Data Fig. 7b, c). Importantly, in Yap“ 
mice, stromal Ereg expression was even higher, while expression of 
other stromal growth factors was unchanged (Extended Data Fig. 7b, 
d). Thus, elevated stromal Ereg may compensate for Yap loss in some 
crypts that eventually regenerate the gut. Since reducing Wnt signal- 
ling also rescues Yap“ crypts in vitro, our results suggest that in vivo 
Yap coordinates Wnt antagonism and Egf signalling to synergistically 
mediate efficient crypt regeneration. 

To explore whether Yap regenerative responses contribute to can- 
cer, we examined intestinal tumorigenesis models driven by loss of the 
tumour suppressor Apc. In Apc” adenomas and Apc“ organoids, 
Yap displayed nuclear localization (Extended Data Fig. 8a and Fig. 3c) 
and Yap deletion virtually abolished polyp formation in Ape“ mice, 
markedly extending lifespans (Fig. 3d). Accordingly, Yap loss also 
blocked growth of Apc“ organoids and induced apoptosis (Fig. 3c, 
panel ii), while overexpression stimulated growth (Fig. 3e, panels i 
and ii). Thus, deregulated Wnt signalling caused by spontaneous 
Apc loss synergizes with Yap to drive tumorigenesis. We next exam- 
ined the effect of crypt hyperplasia on acute Apc loss driven by 
Vil-CreERT. In contrast to a recent report’®, deletion of Yap, Taz, or 
both Yap and Taz, had no effect on crypt hyperplasia, and Yap 
remained primarily cytoplasmic, indicating that Apc deletion in vivo 
is insufficient to drive nuclear localization of Yap (Extended Data 
Fig. 8b-d). In summary, Yap is required for adenoma formation but 
is dispensable for the proliferative burst associated with acute Apc loss. 

Our results suggest that Apc deletion short circuits Yap inhibition of 
Wnt signalling, thus synergizing with Yap to drive polyp formation. 
Indeed, Apc“ organoids displayed strong induction of Wnt targets 
that were only weakly affected by Yap loss, consistent with Yap 
antagonism upstream of Apc (Extended Data Fig. 9a)*°. However, 
Yap-induced genes that included Egfr agonists, and Yap suppression 
of Paneth markers, were unaffected by Apc loss, indicating that Yap also 
suppresses Paneth cells independent of its anti-Wnt effects (Extended 
Data Fig. 9a). Of note, Apc“’“ organoids do not require exogenous Egf 
(Fig. 3c), so we explored whether this is due to the Yap-induced Egf 
module. Egfr (PD153053) or Mek (U0126) inhibition abrogated growth 
of Apc’ spheroids and Yap-transgene-driven Apc’ spheroids 
(Fig. 3c, panel iv, and 3e, panels iii-vi). Furthermore, Yap loss dimin- 
ished, whereas upregulated Yap enhanced, phospho-Erk1/2 levels 
(Extended Data Fig. 9b), and ectopic Ereg rescued Yap*/4;Apc“/4 
spheroid growth (Extended Data Fig. 9c, d). The Egf module of 
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Figure 3 | Yap/Egfr signalling in intestinal 
regeneration and tumorigenesis. a, Yap*’4 and 
Yap~’4 organoids were supplemented with 

0.5 ug ml of recombinant Egf, Areg, or Ereg. 
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de novo crypts formed after 4 days of culture (n = 4 
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Yap regenerative signalling is thus an important target of the pro- 
tumorigenic effects of Yap. 

To examine how Yap-driven regenerative signalling contributes to 
Apc mutant adenoma formation, we generated Yap;Apc'8?* 
(Yap™";Ape™";Lgr5-creERT) mice, which allow timed induction of 
sporadic Yap-null and Apc-null ISCs that are tracked by staining 
Yap and the Wnt target gene Lef, respectively (Extended Data Fig. 
10a, b and Methods). This yielded both Yap wild-type and null Apc 
mutant, Lef* foci that were readily detected early, consistent with Yap 
and Taz being dispensable for expansion of acute Apc ‘~ lesions 
(Extended Data Fig. 10c, panels i and ii). Furthermore, Yap was cyto- 
plasmic in most, but not all, cells (Extended Data Fig. 10c, panel iii), 
further supporting that Apc loss is not sufficient to drive nuclear 
localization of Yap. However, at later stages, when Lef* lesions 
matured to morphologically distinct aberrant crypt foci and adeno- 
mas, Yap was predominantly nuclear (Extended Data Fig. 10c, panel 
vi), which coincided with expression of the Yap regenerative signature, 
including Egf pathway components (Fig. 3f). The rarity of adenomas in 
Yap;Ape*"8""" and Yap“/4;Apc™" mice (data not shown) raised the 
question of why Yap mutant Lef” foci undergo early demise. 
Apoptosis was not altered in Yap-deficient, Lef’ foci (Extended 
Data Fig. 10d), but Paneth cells were strongly enriched (Fig. 4a, b, 
panels i-iii, and Extended Data Fig. 10e, panels i-iii) with many lyso- 
zyme’, Yap mutant foci displaying weak Lef expression (Fig. 4b, 
panels iv-vi, and Extended Data Fig. 10e, panels iv—vi), indicating 
depletion into post-mitotic Paneth cells. Notably, mTOR inhibition 
also increases Paneth differentiation and reduces tumorigenesis in Apc 
models”. We also examined Egfr activity in these early lesions. 
Although weaker than in mature lesions, where the Yap regenerative 
program is robustly expressed, Egfr activation was readily detected in 
Yap wild-type lesions, but significantly reduced in Yap mutants 
(Fig. 4c and Extended Data Fig. 10f). Thus, Yap regenerative signalling 
probably promotes progression of Ape ’~ foci to adenomas by sup- 
pressing Paneth differentiation and activating Egf signalling. 

These studies define a central role for the Yap effector of the 
Hippo pathway in reprogramming ISCs during intestinal regeneration 
and driving tumorigenesis (Fig. 4d). Yap protects the ISC pool by 
transiently suppressing Wnt signalling and the Lgr5* ISC population, 
while blocking differentiation into Paneth cells and driving a pro- 
regenerative program critically dependent on induction of Egfr signal- 
ling. We showed that aberrant crypt morphogenesis in Yap-deficient 
organoids was overcome by either reducing Wnt signalling or by 


organoids grown for 5 days (i-ii). Proliferation and 
apoptosis were examined by Edu incorporation 
and active caspase 3, respectively. Yap expression 
is shown in panel iii. Treatment of Apc“ 
organoids with Egfr inhibitor (PD153035) mimics 
loss of Yap (iv) (n = 5). d, Tumour load in age- 
matched ApceM"", Yap*’* (n= 12), Yap*/4 

(nm = 19) and Yap’ (n = 19) mice (n = number 
of mice per group; error bars indicate s.e.m.; 
***P <0).0001). The right panel shows survival 
curves of Yap*/* (n = 9), Yap*/4 (n = 16) and 
Yap*/4 (n = 23) mice. e, YapTg:Apc“4 organoids 
were cultured 4 days with or without doxycycline 
and treated with inhibitors: PD153035 (Egfr) or 
U0126 (Mek) (n = 5). f, Expression of 1133, Cyr61, 
Msln, TweakR and phospho-Egfr (Tyr-1092) in 
Apc mutant aberrant crypt foci. Scale bars, 70 jm. 


addition of the Egfr ligand epiregulin. These two Yap-dependent 
processes may thus function in parallel and act synergistically in vivo 
to promote regeneration. Finally, we describe a mechanistic link 
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Figure 4 | Yap-dependent Egfr activation and suppression of Paneth cell 
differentiation in tumour initiating cells. a, Relative abundance of Paneth 
cells in Yap-positive (n = 207) and Yap-negative (n = 201) Lef* foci from 
tamoxifen-induced Yap;Apc“"*** mice (n = number of Lef* foci; error bars 
indicate s.e.m.; ***P < 0.0001, see Methods). b, Two sets of consecutive 
sections (i-iii and iv-vi) showing Lef, Yap and lysozyme (Lyz) staining of 
representative Lef* foci. c, Lef, Yap and phospho-Egfr (i-iv) staining of 
consecutive sections from Yap;Apc“"8"* mice. Graph shows quantification of 
expression levels of phospho-Egfr in Yap positive and negative Lef* foci (see 
Methods). Closed and open arrowheads indicate Yap positive and negative 
Lef* foci, respectively. d, A model of Yap function in intestinal regeneration 
and cancer. Scale bars, 70 um. 
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between tissue regeneration and cancer in which mutations in APC or 
B-catenin that short-circuit Yap suppression of Wnt* synergize with 
Yap regenerative signalling and Paneth cell suppression to create early 
tumorigenic lesions. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. 

Mice. Yap“ and Taz“’“ mice were generated by crossing Yap or Taz floxed 
mice’ with the villin-cre line (Jackson Laboratory), the villin-creERT2 line 
(S. Robine, Institut Curie-CNRS) or the Lgr5-creERT line (Jackson Laboratory). 
The Rosa26-lox-STOP-lox-rtta-IRES-EGFP and Rosa26 lacZ mouse lines were 
obtained from Jackson Laboratory. The YapTg transgenic line described in this 
study was generated by introducing a HA-tagged wild-type Yap cDNA down- 
stream of 7 Tet-repressor elements in the pTRE2 vector (J. Whitsett, Cincinnati 
Children’s Hospital Medical Center). The transgenic Yap construct was linearized 
and microinjected in ICR embryos. As shown in Extended Data Fig. 3a, activation 
of Cre deletes a neo cassette and allows for expression of the rtTA gene. In the 
presence of doxycycline the rtTA activates transcription of HA-Yap. Apc floxed 
mice were obtained from O. Sansom (Beatson Institute). Lats] and Lats2 floxed 
alleles were obtained from R. Johnson (MD Anderson Cancer Center) and crossed 
with villin-creERT2 mice to obtain Lats1“/4;Lats2“4 mice. To measure polyp 
formation, Yap“ mice were backcrossed to a BI/6 background for 4 generations 
before crossing to Ape“”* mice. Polyps from Yap‘’* (Yap*/*;villin- 
creApeM’*), Yap*/4 (Yap™* ;villin-creApe™™*) and Yap4 (Yap™villin- 
cre;Apc™i"’*) mice were counted 16 weeks after birth or when animals appeared 
moribund. Survival of Apc" mice was measured by the number of days before 
mice were euthanized due to poor health. In vivo assays comparing control and 
Yap mutant animals were performed between age- and sex-matched pairs. No 
method of randomization was followed and no animals were excluded in this 
study. The investigators were not blinded to allocation during experiments and 
outcome assessment. Inducible Cre-mediated deletion of genes was performed by 
intraperitoneal injections of >5-week-old mice with 200 yl tamoxifen in corn oil 
at 10mgml7'. To create mosaic expression of Yap, Yap™";villin-creERT2 mice 
were induced with a single injection of 200 jl of tamoxifen at a suboptimal dose 
typically between 0.5 and 2.0 mg ml’. For in vivo regeneration assays, mice were 
given a single dose of 10 or 12 Gy using a GammaCell 40 irradiator. Animals were 
maintained and handled under procedures approved by the Canadian Council on 
Animal Care. 

Immunohistochemistry and in situ hybridization. The immunohistochemistry 
stainings and standard colorimetric in situ hybridization were carried out accord- 
ing to methods described elsewhere*’. Staining experiments were repeated on 
independent tissue sections prepared from separate mice as indicated by n values 
in figure legends. The following primary antibodies were used for immunostain- 
ing: rat anti-Ki67 (Dako, Cat. no. M7249, 1:1,000), rabbit anti-Yap/Taz (Cell 
Signaling, Cat. no. 8418, 1:100), rabbit anti-Yap (Cell Signaling, Cat. no. 14074, 
1:300), mouse anti-Yap (Santa Cruz, Cat. no. sc-101199, 1:100), rabbit anti-Lef 
(Cell Signaling, Cat. no. 2230, 1:300), phosphor-Egfr (Tyr1092) (Abcam, Cat. no. 
ab40815, 1:300), anti-cleaved caspase-3 (Cell Signaling, Cat. no. 9664, 1:300) and 
anti-lysozyme (Dako, Cat. no. A0099, 1:1,000). Detection of primary antibodies 
was achieved using the Dako Envision plus system. Multi-colour fluorescence in 
situ hybridization with tyramide signal amplification (TSA) was done essentially 
as described elsewhere**™*. In brief, RNA probes from hybridized sections were 
detected using appropriate hapten-specific HRP-conjugated antibodies (anti- 
digoxigenin-HRP (Roche, Cat. no. 11207733910, 1:500), anti-dinitrophenyl- 
HRP (PerkinElmer, Cat. no. NEL747A001KT, 1:300), and anti-fluorescein-HRP 
(Life Technologies, A21253, 1:500)). After overnight incubation with antibodies at 
4°C (or 2h at room temperature for anti-fluorescein-HRP detection of cryptin1) 
sections were washed in PBS, and rinsed twice in 100 mM borate pH 8.5 plus 0.1% 
BSA. TSA reaction was performed by applying 300 il per slide of the following 
mixture: 100 mM borate pH 8.5, 2% dextran sulfate, 0.1% Tween-20 and 0.003% 
H,0,, 450 pg ml! 4-iodophenol: 1:250 Tyramide product (that is, DyLight633- 
tyramide, Dylight488-tyramide, Dylight 555-tyramide). The TSA reaction was 
allowed to proceed for 20min and then terminated by washing slides in 
100 mM glycine pH 2.0 for 15 min. Sections were washed further in PBS for the 
next round of detection. To synthesize tyramide products, the following succini- 
midyl esters were used for conjugation with tyramine: DyLight 633 NHS-Ester 
(Thermo Scientific Cat#46414), DyLight 550 NHS-Ester (Thermo Scientific 
Cat#62262), DyLight 488 NHS-Ester (Thermo Scientific Cat. no. 46402). The 
synthesis reaction was carried out as described previously”. 

The following in situ hybridization probes were obtained from the collection 
of MGC clones at the Lunenfeld Tanenbaum Research Institute: TweakR 
(BC025860), Ly6c1 (BC092082), Edn (BC029547), Areg (BC009138), Ereg 
(BC027838), I1rn (BC042532), 1133 (BC003847), Msln (BC023753) and Cyr61 
(BC066019). The Olfm4 and cryptdin1 probes were a gift from H. Clevers 
(Hubrecht Institute). 

Immunofluorescence. Before fixing organoids, 101M Edu was added to the 
culture media for 1h. Then organoids were fixed in 10% buffered formalin for 
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30min, permeabilized in 0.5% Triton for 20min and blocked in 2% BSA. 
Incorporated Edu was detected using the ClickIt EDU Imaging kit (Invitrogen) 
according to the manufacturer’s instructions. The primary antibodies used for 
immunostaining were mouse anti-Yap (Santa Cruz, Cat # sc-101199, 1:100), 
mouse anti-HA (Sigma-Aldrich, Cat. no. H9658, 1:1,000), and chicken anti-B- 
gal (Abcam, Cat. no. ab9361, 1:300). The secondary antibodies used in immuno- 
staining were: CF555-donkey anti-mouse (Biotium, Cat. no. 20037, 1:400) and 
CF647 donkey anti-rabbit (Biotium, Cat. no. 20047, 1:400). Organoids were coun- 
terstained with 4’,6-diamidino-2-phenylindole dihydrochloride (DAPI) (Sigma- 
Aldrich) before mounting onto slides for visualization. Images were acquired 
using a 20X/NA oil immersion objective lens (HCX PL APO, Leica), an EM- 
CCD camera (ImagEM, Hamamatsu) on an inverted microscope (DMIRE2, 
Leica) with a spinning disk confocal scanner (CSU10, Yokogawa) and Volocity. 
Quantification of de novo crypt formation. De novo crypts were scored as any 
protrusions, typically containing Paneth cells, budding from the initial sphere 
formed after seeding isolated crypts. Crypts were counted from bright-field images 
using Image J. At least four independent cultures derived from four different mice 
per genotype were used for quantification. 

In vivo quantification. Survival of crypts in Fig. 1 was determined by Ki67 
staining of cross-sections of proximal portions of the small intestine at 3 days 
post-irradiation (10 Gy or 12 Gy). Values in Fig. 1b represent average number 
of fully labelled Ki67~ crypts per intestinal circumference based on counts from at 
least two sections per mouse and assays were repeated in 6 independent mice per 
genotype for both 10 Gy and 12 Gy treatments. The percentage of surviving Yap- 
positive versus negative Lgr5* ISCs in Fig. 1d was performed by counting 587 
B-gal* crypts from a total of 7 untreated Yap“/8"°* mice and 394 B-gal* crypts 
from a total of 9 irradiated Yap*'8*°* mice. In Yap;Apc“'8"* mice tumour 
initiating cells were visualized by staining for the Wnt target gene, Lef. As shown 
in Extended Data Fig. 10b, Lef is undetected in wild-type crypts and highly upre- 
gulated in Apc-null cells and thus serves as a robust marker of Apc deletion*’. The 
percentage of Paneth cells in Lef* foci (Fig. 4a) was assessed by preparing con- 
secutive sections stained for Lyz, Lef and Yap, respectively. Lysozyme-positive 
Paneth cells from a total of 207 Yap wild-type and 201 Yap mutant Lef* foci were 
counted from 5 Yap;Apc"8*"* mice (10-16 days after tamoxifen injection) using 
Image J and the percentage of total cells within the boundaries of a given Lef* 
lesion was calculated. Relative activation of Egfr was quantified in consecutive 
sections from 5 Yap;Apc“"8">"* mice stained for Lef, Yap and phospho-Egfr. For 
assessing Phospho-Egfr, staining intensity in Lef* foci was assessed in a blinded 
fashion. For this, consecutive sections stained for Yap were masked from the 
observer scoring phospho-Eefr staining intensity. Lef* foci were scored as ‘+’ if 
phospho-Egfr expression was elevated compared to wild-type adjacent crypts at 
comparable levels within the crypt-villus axis (see Extended Data Fig. 10f, panels 
xiand xii). Lef* foci were scored as ‘+ +’ if staining intensity was very strong even 
relative to the stem cell compartment in normal crypts and/or displayed prom- 
inent apical staining (see Yap-positive foci in Fig. 4c, panel iv, and Extended Data 
Fig. 10f, panels v and vi). Lef* foci were scored as ‘~’ if staining intensity was 
undetected or unchanged relative to adjacent wild-type crypts (see Yap mutant 
foci in Fig. 4c and Extended Data Fig. 10f). In Extended Data Fig. 1, caspase 3 and 
BrdU positive crypt cells were counted from at least six sections per mouse in 4 
independent mice per genotype and expressed as a percentage of total crypt cells. 
Statistics. All data are presented as average values with s.e.m. Mann-Whitney 
(two-tailed) U-test was used to determine statistical significance. Calculations 
were performed using GraphPad Prism 5 software. 

RNA-seq analysis. RNA was isolated from organoids cultured for 24 h after 
seeding in Matrigel. RNA samples were pooled from at least three organoid cul- 
tures derived from at least three independent mice per genotype (Yap"* ;villin-cre, 
Yap™";villin-cre and YapTg). Quality of RNA was verified by running samples ona 
Bioanalyzer. High-throughput sequencing was performed using the Illumina 
HiSeq 2000 at the Lunenfeld Tanenbaum Research Institute (LTRI) sequencing 
facility. Raw sequencing reads in Fastq formats were mapped onto mouse genome 
(mm{9) using Tophat 1.4.1 and the RPKMs (reads per kilobase of exon model per 
million mapped reads) were calculated using a customized script. RNA-seq data 
are presented in Supplementary Table 1. Combined fold change presented in 
Extended Data Fig. 3d and Supplementary Table 1 was calculated using the fol- 
lowing formula: combination fold change = log, (Yap 4) Yap*/)/ (Dox+/Dox—)]. 
R, Cluster 3.0 and Java TreeView were used for data visualization. 

Organoids. Gut organoids were cultured according to a previously described 
protocol established by Sato and Clevers’. Briefly, crypts were harvested by incub- 
ating opened small intestines in PBS containing 2 mM EDTA. The epithelium was 
released by vigorous shaking and crypts separated using a 701m cell strainer. 
Crypts were seeded in growth factor reduced Matrigel (BD Biosciences) and grown 
in Advanced DMEM/F12 (Invitrogen) supplemented with 2mM GlutaMax 
(Invitrogen), 100 Uml ? Penicillin/100 ig ml Streptomycin (Invitrogen), N2 
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Supplement (Invitrogen), B-27 Supplement (Invitrogen Cat), mouse recombinant 
Egf (R&D Systems), 100ngml’ mouse recombinant Noggin (Peprotech), 
150ngml-' human Rspl (R&D Systems). Apc-deficient organoids were 
harvested from Yap” * Ape" villin-creERT, Yap"! Ape villin-creERT or 
YapTg;Ape" S-yillin-creERT mice injected with tamoxifen and seeded 48 h later 
in basal growth medium without Egf, Rsp1 or Noggin. To induce Yap expression 
in YapTg organoids, 1.5 pg ml * doxycycline was added to the culture medium 
on day 0. Egf (R&D Systems, Cat. no. AF2028), Areg (R&D Systems, Cat. no. 
AF989) and Ereg (R&D Systems, Cat. no. 1068-EP-050) were added to the culture 
medium at a final concentration of 0.5 pg ml '. The following inhibitors were 
used: PD153053 (0.5 1M, Tocris Bioscience), U0126 (10 4M, Merck Millipore). 
To examine pErk1/2 levels, organoids were harvested at day 2 in cold PBS con- 
taining 5mM EDTA, 1mM NaVO,, 1.5mMNaF and protease inhibitors. 
Organoids were incubated at 4°C for 30 min to dissolve Matrigel and then lysed 
in TNTE buffer (50mM Tris/HCl pH 7.6, 150mM NaCl, 0.5% Triton X-100, 
1mM EDTA) containing standard protease and phosphatase inhibitors. Protein 


concentrations were measured and samples were subjected to SDS-PAGE. Total 
RNA was extracted by removing culture medium and directly lysing organoids in 
wells using RTL buffer of the Rneasy Mini Kit (Qiagen). RNA was purified using 
columns and genomic DNA was removed by treatment with RNase-Free 
DNase (Qiagen). 
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Extended Data Figure 1 | Analysis of late regenerative responses in Yap- 
deficient crypts after irradiation. a, Staining of untreated Yap*/4 and Yap“ 
intestines with Yap/Taz antibodies. Filled arrowheads point to crypt base 
columnar cell (CBC) stem cells and open arrowheads indicate Paneth cells. 

b, Comparison of crypt proliferation in Yap’ and Yap“ mice at 3, 4 and 
6 dpi (10 Gy) by staining representative sections with Ki67 antibodies. Sections 
in bottom panels were immunostained with anti- -Yap/Taz antibodies to 
confirm the absence in Yap and Taz expression in Yap“ crypts at 6 dpi. 

c, Stainings of control Yap*/4;Taz*/4 versus Yap“’4;Taz4 mice at 6 dpi 

(10 Gy) with Ki67 and Yap/Taz antibodies. d, Mosaic analysis of Yap in the late 
regenerative response. Pairs of consecutive sections from mice displaying 
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mosaic intestinal expression of Yap at 6 days post-irradiation (10 Gy) were 
stained for Yap (left panel) and Ki67 (right panel). Open arrowheads in 
consecutive sections represent Yap-null crypts and filled arrowheads point to 
Yap-positive crypts. Images in b and c are representative of at least three 
stainings performed on tissues derived from se eparate mice. e, f, Analysis of 
apoptotic and mitotic cells in Yap*/4 and Yap~”“ intestines at 1 dpi (12 Gy). 
Representative stainings of anti-active caspase 3 and BrdU incorporation are 
shown to the left. Bar graphs represent percentage of caspase 3* cells and 
BrdU" cells within the crypt epithelium scored from at least 4 individual mice 
per genotype. Error bars indicate s.e.m.; 1 = 30 (n represents the total number 
of sections scored per genotype); ***P < 0.0001. Scale bars, 70 jum. 
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Extended Data Figure 2 | Yap localization and function in organoid c, Proliferation and apoptosis in Yap*/4 and Yap“ organoids grown for 3 days 
cultures. a,b, Crypts from Yap’ and Yap“ mice were harvested and were evaluated by examining incorporation of Edu (red) and active caspase 
cultured under standard conditions (see Methods) for the indicated times. 3 (yellow), respectively. Endogenous Yap expression is shown in green. Panels 


Panel b depicts the percentage of organoids showing 0, 1, 2,3 or 24 de novo __ iiiand iii’ show cytoplasmic localization of Yap. Panels iv and iv’ show nuclear 
crypts at 4 days (error bars indicate s.e.m.; n = 7 (n represents the number of — accumulation of Yap in forming crypts. Arrowheads indicate increased 
separate cultures per genotype per mouse; ***P = 0.0006, **P < 0.0021)). apoptotic cells in Yap“ organoids. Scale bars, 70 jm. 
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Extended Data Figure 3 | Identification of Yap regulated genes. 

a, Schematic representation of YapTg mice. Induction of HA-Yap was achieved 
by intercrossing villin-cre or villin-creERT2 mice with Rosa26-lox-STOP-lox- 
rtta-IRES-EGFP mice (see Methods). b, Analysis of HA-Yap protein 
expression in the intestinal epithelium. Intestinal crypts were isolated, lysed and 
subjected to SDS-PAGE (left panel). Expression of HA-Yap is only detected 
in transgenic mice in the presence doxycycline. Immunohistochemistry 
staining using anti-HA and Ki67 antibodies in untreated small intestine of 
YapTg mice (right panel). c, Analysis of Yap overexpression in organoid 
cultures. Crypts from YapTg intestines were seeded and induced with 
doxycycline. Representative organoids cultured for 3 days are shown as bright- 
field images or Edu (red) and caspase 3 (yellow) stainings. Yap transgene 
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expression was detected by anti-HA staining (green) (n = 5). Arrowhead 
indicates diminished Edu incorporation in doxycycline-induced organoids. 
Scale bars, 70 jum. d, Identification of relative expression of Yap-regulated genes 
by RNA-seq anal as are shown as rank order plots comparing control and 
Yap*/4 and Yap*’“ organoids isolated at day 1, as well as doxycycline 
treated and untreated YapTg organoids at day 1 of culture: combined fold 
change = log, [(Yap“’4/Yap'“)/YapTg(Dox+/Dox-)]. e, £, q2CR \ analysis of 
selected Yap-regulated genes comparing fold change between Yap*/4 and 
Yap“’4 or doxycycline-treated and untreated YapTg organoids at day ‘ 
respectively. Error bars indicate s.e.m.; n> 3 (n represents the number of 
independent organoid cultures per genotype per mouse analysed for 
each gene). 
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Extended Data Figure 4 | Lowering Rspol levels rescues growth of Yap 
mutant organoids. a, Fluorescence immunostaining of Yap’ and Yap“4 
mice at 3 dpito detect Ki67~ cells (yellow) and lysozyme positive (Lyz*) Paneth 
cells (red). b, Representative images of Yap ‘’ 4 (i, iii, v) and Yap* 4 (ii, iv, vi) 
organoids cultured in reducing Rspol concentrations for 3 days. Scale bars: 

a, 35 um and b, 70 pm. c, Quantification of the percentage of organoids 
displaying 0, 1, 2, 3 or =4 de novo crypts after 3 days in culture using indicated 


concentrations of Rspol. Error bars indicate s.e.m.; n = 4 (n represents 
number of organoid cultures analysed per genotype per mouse). d, Relative 
expression of Paneth cell markers (Kit, Lyz, Wnt3, Defa5) was evaluated by 
qPCR. Graphs show representative results of 3 independent organoid cultures 
per genotype. Error bars represent minimal and maximal range in fold 
differences derived from the standard deviation in Ct values; n = 3 

(n represents the number of technical repeats). 
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Extended Data Figure 5 | Yap-dependent expression of pro-regenerative TweakR (i-iv), Ly6c1 (v-viii), Il1rn (ix—xii), Areg (xiii-xvi) and Clu (xvii-xx). 
genes in the intestinal epithelium after irradiation. ISH on untreated and _ Images are representative of at least two stainings per gene performed on tissues 
irradiated (2 dpi, 12 Gy) Yap*/4 and Yap“ intestines using specific probes for _ derived from separate mice. Scale bars, 70 jm. 
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Lats1“4;Lats2“4 
Extended Data Figure 6 | The Yap-dependent regenerative program is Yap““ intestines (n = 3). Edn expression is detected in ISCs post-irradiation 
activated in Lgr5* ISCs. a, ISH showing induction of Areg (i-ii), Ereg (iii-iv), in a Yap-dependent manner. Arrowheads point to location of Olfm4* ISCs. 
MslIn (v-vi), 1133 (vii-viii), Clu (ix-x) and Yap protein expression (xi-xii) in Images are representative of three stainings per gene performed on tissues 
intestinal epithelium of Lats”/4;Lats2“/4 mice. b, Fluorescence ISH to detect derived from separate mice. Scale bars: a, 70 jim; b, 35 um. 
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Extended Data Figure 7 | Stimulation of Yap-independent growth via 
stromally derived Ereg expression. a, Yap*/4 and Yap““ organoids were 
grown for 3 days in standard growth media and supplemented with 0.5 jg ml 
of recombinant Ereg. Panels in the top row show Edu incorporation and 
active caspase 3 stainings in onganoid cultures. Proliferation and apoptosis 
status in Ereg-stimulated Yap“”“ organoids are comparable to control 
organoids. Arrowhead points to apoptotic cells in Yap-deficient organoids. 
Panels in the bottom row show endogenous Yap expression and confirm that 
Ereg-stimulated Yap“ organoids are Yap deficient. Green signal in Yap4/4 
organoids is non-specific staining of cellular debris in lumen. b, ISH to 
monitor Ereg expression in untreated and irradiated (2 dpi, 12 Gy) Yap*/4 and 
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Yap" mice. In iii and vi, dotted lines demarcate crypt boundaries. c, Ereg was 
detected by fluorescence ISH (red) and epithelia highlighted by counterstaining 
for B-catenin protein (green). Open arrowheads point to examples of Ereg 
expression in the epithelium, and filled arrowheads indicate stromal cells. Note 
Ereg expression in certain cells of the regenerating epithelium. All images are 
representative of three stainings performed on tissues derived from separate 
mice. Scale bars (a-c), 70 um. d, qPCR analysis of stromally derived factors 
Ereg, Sfrp1, Bmp4, Wnt2b, Wnt4, WntSa from samples of whole intestines 

of irradiated (2 dpi; 10 Gy) Yap“ and Yap“ mice. Error bars indicate s.e.m.; 
n=7 (n represents number of independent mice analysed per genotype). 
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Extended Data Figure 8 | Role of Yap in Apc mutant cells. a, Apc””* 
adenomas display high levels of nuclear Yap. b, Comparison of crypt 
proliferation in Yap*4;Taz*/4;Ape“/4 and Yap/4;Taz4;Apc4 mice at 2 
and 4 days after tamoxifen injection by staining representative sections with 
Ki67 antibodies (panels i-iv). Sections in panels v and vi were immunostained 
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with anti-Yap/Taz antibodies. Images are representative of three stainings 
performed on tissues derived from separate mice. Scale bars, 70 jum. 

c, d, Genotyping of tail and crypt DNA and western blot analysis from indicated 
mice at 4 days after tamoxifen injection to confirm deletion of Yap and Taz. 
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Extended Data Figure 9 | Role of Yap- and Egfr-dependent signalling in 
Apc mutant cells. a, qPCR analysis of selected Yap regulated genes comparing 
fold change between Apc mutant Yap’ and Yap“ organoids. Error bars 
indicate s.e.m.; n > 4 (n represents the number of independent organoid 
cultures per genotype per mouse analysed for each gene). b, Western blot 
analysis showing expression of pErk1/2 in Yap*4;Apc“/4 (het), Yap“/“;Apce“4 
(KO) and doxycycline-treated YapTgApe“ organoids (Tg). Effects of 
PD153035 and U0126 treatment on pErk1/2 levels are also shown (n = 3; 

n represents the number of separate organoid cultures analysed per genotype). 
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c,d, Yap*/4;Apc™4 and Yap“;Apc“/4 mice were injected with tamoxifen and 
crypts harvested 48 h later. Organoids were grown in medium lacking Rsp1, 
Noggin and Egf and supplemented with 0.5 pg ml! of recombinant Ereg. 
Immunofluorescence shows that Ereg treatment strongly enhances Edu 
incorporation in both control and Yap“’4;Apc“ (middle panels). Yap 
stainings (right panels) show that Ereg-stimulated Yap““;Apc*“ organoids 
lack endogenous Yap. Images are representative of at least three cultures 
derived from individual mice. Scale bars, 70 Lm. 
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Extended Data Figure 10 | Role of Yap in Apc mutant tumour initiating 
cells. a, Experimental procedure to target Yap in Apc mutant Lgr5~ ISCs. 

b, Tracing of Apc” cells by staining for the Wnt target gene Lef at 4 

and 30 days after tamoxifen injection (dpi) of Yap;Apc*’8""“" mice. Note 

Lef staining is absent in surrounding wild-type crypts. c, Consecutive sections 
from Yap;Apc*"8">* mice at 10 dpi demonstrate both Yap negative (open 
arrowheads) and Yap positive (filled arrowheads) Lef* foci (panels i-iii). 
Arrowheads in panel iii indicate occasional nuclear Yap staining in early Lef* 
foci. In Lef* foci displaying aberrant crypt morphology, Yap is strongly 


nuclear (panels iv-vi). d, Consecutive sections from Yap;Apce*"8"** mice at 10 
dpi showing the levels of caspase 3* apoptotic cells in both Yap-positive (filled 
arrowheads, panels i-iv) and Yap-negative (open arrowheads, panels v-viii) 
Lef* foci. e, Two sets of consecutive sections (i-iii and iv-vi) showing Lef, Yap 
and Lyz staining of representative Lef” foci. f, Two sets of consecutive sections 
(i-vi and vii-xii) showing Lef, Yap and phospho-Egfr staining of tamoxifen- 
induced Yap;Ape“'"°“" mice. Filled and open arrowheads indicate 
Yap-positive and -negative Lef* foci, respectively and panels ii, iv, vi, viii, x 
and xii are enlargements of adjacent panels. Scale bars, 70 jum. 
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Enterococcus faecalis is both a common commensal of the human 
gastrointestinal tract and a leading cause of hospital-acquired 
infections’. Systemic infections with multidrug-resistant enterococci 
occur subsequent to gastrointestinal colonization’. Preventing col- 
onization by multidrug-resistant E. faecalis could therefore be a 
valuable approach towards limiting infection. However, little is 
known about the mechanisms E. faecalis uses to colonize and com- 
pete for stable gastrointestinal niches. Pheromone-responsive con- 
jugative plasmids encoding bacteriocins are common among 
enterococcal strains’ and could modulate niche competition among 
enterococci or between enterococci and the intestinal microbiota. 
We developed a model of colonization of the mouse gut with 
E. faecalis, without disrupting the microbiota, to evaluate the role 
of the conjugative plasmid pPD1 expressing bacteriocin 21 (ref. 4) in 
enterococcal colonization. Here we show that E. faecalis harbouring 
pPD1 replaces indigenous enterococci and outcompetes E. faecalis 
lacking pPD1. Furthermore, in the intestine, pPD1 is transferred to 
other E. faecalis strains by conjugation, enhancing their survival. 
Colonization with an E. faecalis strain carrying a conjugation- 
defective pPD1 mutant subsequently resulted in clearance of vanco- 
mycin-resistant enterococci, without plasmid transfer. Therefore, 
bacteriocin expression by commensal bacteria can influence niche 
competition in the gastrointestinal tract, and bacteriocins, delivered 
by commensals that occupy a precise intestinal bacterial niche, may 
be an effective therapeutic approach to specifically eliminate intest- 
inal colonization by multidrug-resistant bacteria, without profound 
disruption of the indigenous microbiota. 

The mammalian host is colonized by 10-100 trillion microbes that 
live in a predominantly symbiotic relationship with their host**. 
Invasion by some of these symbionts can cause serious disease when 
homeostasis is disrupted’. One such symbiont is E. faecalis, a Gram- 
positive member of the gut microbiota of a wide range of mammals, 
including humans’. Although enterococci are usually not pathogenic, 
they can cause significant disease in immune-compromised indivi- 
duals. Treating enterococcal infections is challenging owing to their 
intrinsic and acquired resistance to a wide range of antibiotics””°. Prior 
antibiotic therapy is a well-known risk factor for enterococcal infec- 
tion”. Antibiotic disruption of the intestinal biota enables resistant 
enterococci to proliferate profusely in the gastrointestinal tract’? and 
invade the host'’. Reducing colonization of the gastrointestinal tract by 
antibiotic-resistant enterococci could therefore represent a promising 
approach for preventing enterococcal infections. However, our mech- 
anistic understanding of enterococcal gastrointestinal colonization is 
limited. Most previous studies employed antibiotic disruption of the 
intestinal microbiota before enterococcal challenge, resulting in inva- 
sion rather than colonization. Our model establishes long-term col- 
onization of mice with a marked strain of E. faecalis without antibiotic 


pretreatment, allowing us to study E. faecalis dynamics in an unper- 
turbed intestinal environment. 

We established long-term colonization of FVB mice by prolonged 
feeding with a rifampicin-resistant E. faecalis strain (EF,), CK135. 
After withdrawal of EF, from the drinking water, mice maintained 
persistent colonization of the gastrointestinal tract and shedding of 
EF, in faeces for more than eleven weeks (data shown for first 
four weeks, Fig. 1a, b). In contrast, mice gavaged with ~10” colony- 
forming units (CFU) of EF, demonstrated short-lived colonization 
(2-3 days), with significant mouse-to-mouse variability. Although 
the specific factors that determine bacterial gastrointestinal coloniza- 
tion fitness are unclear, the success of our model depends on both the 
nature and the route of delivery and offers a means of investigating the 
underlying mechanisms. Similar levels and patterns of EF, coloniza- 
tion were observed in C57Bl/6 mice and with multiple lineages of 
E. faecalis (Extended Data Fig. 1a, b), suggesting that this approach 
for mouse gastrointestinal colonization by E. faecalis is generalizable. 

We sought to understand the effect of bacteriocins—antimicrobial 
peptides often encoded on pheromone-responsive plasmids—on 
enterococcal colonization dynamics of the gastrointestinal tract. 
Bacteriocin production is a tool bacteria can use to enhance the 
stability of their communities’’, by outcompeting closely related bac- 
terial species and establishing a stable niche for the producing 
strain'*!’. Many lactic acid bacteria produce bacteriocins'**. The 
enterococcal bacteriocin, bacteriocin 21 (bac-21), is encoded on the 
sex-pheromone-responsive conjugative plasmid pPD1 (ref. 19) and is 
identical in nucleotide sequence to another well-characterized enter- 
ococcal bacteriocin, AS-48 (ref. 20) (Supplementary Table 1a, b). The 
prevalence of pPD1, first isolated from an enterococcal strain of 
human gingival origin, has not been described. Only the phero- 
mone-responsive and bacteriocin operons (bac) of pPD1 have prev- 
iously been sequenced*’’. Complete sequencing and annotation of 
pPD1 revealed a 57,732-base-pair (bp) plasmid with a total of 59 open 
reading frames (ORFs) (Extended Data Fig. 2 and Supplementary 
Table 2). The pPD1 bac operon contains nine genes, designated from 
bacA to baclI’’, of which bacA encodes a 105-amino-acid precursor of 
bac-21 (ref. 19). The genes bacB to bacl are thought to be involved in 
processing, modification and secretion of the mature 70-amino-acid 
bacteriocin and providing immunity to the bacteriocin producer’. 

Colonization experiments with an E. faecalis strain harbouring 
pPD1 (EF, + pPD1) revealed that EF, + pPD1 was significantly more 
abundant than EF, in faeces and throughout the gastrointestinal 
tract, suggesting more effective colonization (Fig. la, b). To test 
whether pPD1-encoded bac-21 drives enhanced enterococcal coloni- 
zation, we introduced an in-frame deletion of bacAB into pPD1 
(EF, + pPD1::AbacAB). EF, + pPD1::AbacAB did not exhibit a growth 
defect in vitro and lacked bacteriocin activity (Extended Data Fig. 3a). 
Mice were colonized with EF, + pPD1, EF, + pPD1::AbacAB or EF,. 
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Figure 1 | pPD1 enhances E. faecalis competition for an intestinal niche. 
a, b, Mice were colonized by EF, (n = 5) or EF, + pPD1 (n = 6), which were 
enumerated weekly from faeces (a) and at week 4 from each segment of the 
gastrointestinal tract (b) (distal small intestine (DSI), caecum and large 
intestine (LI)). c, d, Mice were colonized with EF, + pPD1::AbacAB (n = 5), 
EF, (1 = 3) or EF, + pPD1 (n = 3). Faecal abundance of E. faecalis was 
determined weekly (c) and in the gastrointestinal tract at week 4 (d). e, Mice 
(n = 5 per group) were colonized with EF,, EF, + pPD1, EF, + pPD1::AbacAB 
or sterile drinking water; at week 4, abundance of total enterococci 
(indigenous and laboratory strains) and rifampicin-resistant laboratory strains 
of E. faecalis was enumerated in each segment of the gastrointestinal tract. 

f, Microbiome analysis. Ordination of E. faecalis (n = 5 mice) and EF, + pPD1 
(n = 5 mice) samples was separated by the Bray—Curtis beta diversity metric 
(Adonis P = 0.007). Samples are connected to help visualize grouping. An 
exponential decay model was fitted to the data ina and. Ina, the rate of decay 
is significantly different between the two groups (P < 0.0001) and in 

c between EF, + pPD1::AbacAB and EF, (P < 0.0001), as well as between 

EF, + pPD1::AbacAB and EF, + pPD1 (P< 0.0001). Horizontal lines in 

b, d and e indicate geometric means. Each symbol represents an individual 
animal; data are representative of six (a, b) and two (e) biologically independent 
experiments. Data are from one experiment in c, d and f. Black dotted 

lines (all panels) indicate the limit of detection at 100 CFU per g faeces. BHI, 
brain-heart infusion medium; Ent, Enterococcus agar; NMDS, non-metric 
multidimensional scaling; ns, not significant; Rf, rifampicin. 


EF, + pPD1::AbacAB showed no colonization advantage over EF, and 
demonstrated impaired long-term persistence in the gastrointestinal 
tract compared to EF, + pPD1 and EF, (Fig. 1c, d), possibly owing to 
the excessive burden of maintaining a large non-functional plasmid. 
Alternatively, it is possible that unknown colonization factors expressed 
in the presence of an intact pPD1 are affected by bacAB deletion, result- 
ing in loss of colonization ability. 

Mice obtained from commercial vendors possess a variable indigen- 
ous intestinal enterococcal population (Fig. le and Extended Data 
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Fig. 3b-e). To determine whether exogenous colonization with EF, 
strains replaced the indigenous enterococci to establish a stable niche 
in the gut, the indigenous enterococcal population was monitored after 
colonization with EF, or EF, + pPD1 (Extended Data Fig. 3b, c). EF, 
accounted for approximately 5-10% of the total enterococci in the 
faeces and declined over time to 0.1-1% (Fig. le and Extended Data 
Fig. 3b-e). However, EF, + pPD1 rapidly dominated the niche within 
one week and maintained this dominance over time (Extended 
Data Fig. 3b, c), suggesting that it was displacing or outcompeting 
the majority of the indigenous enterococci in the mouse gut. 
EF, + pPD1::AbacAB could not compete in the gut as it failed to 
displace the indigenous enterococci (Fig. le and Extended Data 
Fig. 3b-e). Moreover, colonization of the gut with EF, carrying another 
pheromone-inducible conjugative plasmid that lacks bacteriocins 
(pCF10) did not result in domination of the intestinal niche 
(Extended Data Fig. 4). Hence, bac-21 is likely to be responsible for 
the colonization advantage conferred by pPD1. 

Although this result suggests that EF, + pPD1 directly kills com- 
peting enterococci in the gut, bac-21 might also alter physical or nutri- 
tional niches by killing other competing bacterial species*’’. To 
investigate this, high-throughput 16S ribosomal DNA (rDNA) pyro- 
sequencing was used to assess the composition of the gut community 
in colonized mice. We found no significant differences in composition 
between the caeca of control mice and those colonized by EF,, indi- 
cating that colonization of mice with laboratory strains of E. faecalis 
does not substantially alter intestinal microbial ecology (Extended 
Data Fig. 5a). The caecal composition of EF, and EF, + pPD1 were 
significantly different (P = 0.007) and clustered separately (Fig. 1f). 
However, only a member of the Gram-negative Deferribacteraceae, 
Mucispirillum, was significantly different between groups (Extended 
Data Fig. 5b). We suggest that this may be a secondary effect, as 
Gram-positive bacteriocins such as bac-21 have low efficacy against 
Gram-negative bacteria. We conclude that elimination of competing 
enterococcal strains is the primary mechanism by which EF, + pPD1 
enhances its colonization. 

To further investigate bac-21-mediated enterococcal competition in 
the gut, a competitive colonization experiment using differentially 
marked E. faecalis strains was performed. Combinations of EF, and 
plasmid-carrying spectinomycin-resistant E. faecalis (EF, + pPD1) or 
EF, and EF, + pPD1 with an in-frame deletion (EF, + pPD1::AbacAB) 
were given to groups of mice, at indicated ratios (Fig. 2). At week 0, 
faecal shedding of EF,, EF,+pPD1 and EF, + pPD1::AbacAB 
reflected the ratio of the bacteria given in the drinking water (Fig. 2). 
When EF, + pPD1 was given in excess, it rapidly outcompeted EF, 
(Fig. 2g). When EF, was given in excess (Fig. 2a) or equal abundance to 
EF, + pPD1 (Fig. 2d), it did not outcompete EF, + pPD1, despite 
being able to persist at lower levels. Subsequent experiments indicated 
that the ability of EF, to persist was due to conjugative transfer of pPD1 
to the EF, host, thus confirming the competitive advantage conferred 
by bac-21. To control for unanticipated effects of the specific antibiotic 
resistance marker on the survival advantage of E. faecalis, the recip- 
rocal experiment using EF, and EF, + pPD1 was performed and 
yielded similar results (Extended Data Fig. 6). EF, + pPD1::AbacAB 
is outcompeted by EF, in all circumstances (Fig. 2b, e, h), confirming 
its loss of colonization fitness. pPD1 also provided a competitive 
advantage during in vitro co-culturing competitions (Extended Data 
Fig. 7). Complementation of the AbacAB strain restored bacteriocin 
activity, stable gut colonization (Extended Data Fig. 8) and the com- 
petitive advantage over other enterococci (Fig. 2c, f, i and Extended 
Data Fig. 8). 

During competition experiments, we noted the persistence of EF, 
when given in combination with EF,+pPD1. This suggested that 
EF, + pPD1 was able either to clear a niche and enhance overall EF, 
survival or to transfer pPD1 to EF, by conjugation. To determine 
whether conjugation of pPD1 occurred in the gastrointestinal tract we 
screened drinking water, gastrointestinal tissue (Supplementary Table 3) 
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and faecal isolates (Fig. 3a, b) of EF, by colony polymerase chain reac- 
tion (PCR). No evidence of conjugation was observed in drinking water. 
The detection of transconjugants in the intestine or faeces indicated that 
conjugation occurred in the gastrointestinal tract. At week 1, approxi- 
mately 25% of the EF, faecal colonies selected had obtained the 
pPD1plasmid via conjugation (Fig. 3a, b). By week 4, nearly 100% of 
the EF, colonies harboured pPD1 (Fig. 3a, b) and produced bacteriocin 
(not shown). Similar results were observed in each segment of the lower 
gastrointestinal tract (Supplementary Table 3). EF, + pPD1::AbacAB 
showed no evidence of the ability to conjugate its defective plasmid (not 
shown). Although complementation of bacA-E ectopically (pSK29) 
restored bac-21 production in EF, + pPD1::AbacAB, plasmid conjuga- 
tion was not restored in vitro or in vivo. The mechanism underlying this 
observation is currently unknown. 
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Figure 3 | pPD1 is transferred via conjugation within the mouse 
gastrointestinal tract. a,b, Mice (n = 5 per group) were colonized with 
mixtures of EF, and EF, + pPD1 in drinking water at the indicated ratios. 
Faecal EF, colonies (100 per animal) were screened weekly for pPD1. c, Mice 
(n = 5 per group) were stably colonized with EF, and then challenged with 
EF, + pPD1. Faecal EF, colonies (50 per animal) were screened weekly for 
pPD1. In a-c, each symbol represents the percentage of faecal transconjugants 
in an individual animal. The fitted logistic curve is overlain. The results in 
a-c are from one experiment each. 


To determine whether EF, + pPD1 could displace an established 
resident enterococcal population, we performed sequential coloniza- 
tion experiments, in which EF, colonization was stably established, 
followed by challenge with EF,+pPD1. The levels of EF, were 
not notably altered after introducing EF, + pPD1 (Extended Data 
Fig. 9); however, transconjugants of EF, containing pPD1 started to 
appear as early as week 0 and neared 100% by week 4 (Fig. 3c). 
Although these results cannot distinguish extensive transfer of 
pPD1 by conjugation from a small number of transfer events fol- 
lowed by proliferation of pPD1-containing transconjugants, we 
conclude that pPD1 enhances E. faecalis competition in the gastro- 
intestinal tract by bac-21-mediated killing or through transfer of a 
functional pPD1 plasmid via conjugation. To probe conjugation 
dynamics between non-isogeneic enterococci, we investigated indi- 
genous enterococcal transconjugants in mice that were colonized 
with EF, + pPD1 (Fig. le and Extended Data Fig. 3b-e). We found 
that conjugation of pPD1 to indigenous enterococci is possible and 
can be observed both in vivo and in vitro, but also that it happens at a 
low and variable frequency (Extended Data Fig. 10). The reasons for 
this remain unknown, but are likely to be multifactorial, especially in 
the gastrointestinal tract. 

Colonization with antibiotic-resistant enterococci precedes emer- 
gence of enterococcal infections. Bac-21-producing E. faecalis exhibits 
bacteriocin activity in vitro against many multidrug-resistant clinical 
isolates of E. faecium and E. faecalis (Supplementary Table 4). To 
determine whether bac-21 can disrupt in vivo colonization by vanco- 
mycin-resistant E. faecalis V583, mice colonized with rifampicin- 
resistant V583 (V583,) were challenged with the complemented 
bac-21 + strain (EF, + pPD1::AbacAB bacA-E +). The conjugation 
defect of this strain eliminated the risk for bacteriocin transfer to 
V583,. EF, + pPD1::AbacAB bacA-E + successfully eliminated V583, 
levels to below the detection limit in most of the mice compared to 
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Figure 4 | Bacteriocin reduces V583, colonization. Mice (n = 5 per group) 
were colonized with V583,. V583, was removed from the drinking water of both 
groups two weeks before sampling (week -2). Group 1 received sterile water, 
whereas group 2 received EF, + pPD1::AbacAB bacA-E+ in their drinking 
water for two additional weeks, followed by sterile water at week 0. Faecal levels 
of V583, (a) and EF, + pPD1::AbacAB bacA-E+ (b) were enumerated weekly. 
The retention of complementation plasmid pAM401A::bacA-E+ by 

EF, + pPD1::AbacAB bacA-E+ was determined weekly (b). Lines are fitted 
using an exponential decay model and the rate of decay is significantly different 
between the two groups in both a (P < 0.0001) and b (P< 0.0001). Each 
symbol represents an individual animal and data are representative of three 
biologically independent experiments. The black dotted lines indicate the limit 
of detection at 100 CFU per g faeces. Spc®, spectinomycin resistant; Spc*Cm*, 
spectinomycin resistant, chloramphenicol resistant. 


the control group, which showed steady levels of V583, in faeces 
(Fig. 4a). As expected, EF, + pPD1::AbacAB bacA-E + did not transfer 
bacteriocin to V583, during gut colonization and the loss of 
pAM401::bacA-E resulted in persistent colonization by the bacterio- 
cin-defective AbacAB strain (Fig. 4b). V583, clones recovered from 
the one mouse with persistent V583, levels in the treated group 
(group 2) were examined and found to be susceptible to bac-21 
in vitro. We suggest that a stochastic, particularly rapid loss of com- 
plementing plasmid (pAM401::bacA-E) from the challenging strain 
(EF + pPD1::AbacAB bacA-E+) in that particular mouse allowed 
V583, to persist over time. These results demonstrate that introducing 
a conjugation-defective bac-21-producing strain into the gastro- 
intestinal tract can successfully reduce colonization by multidrug- 
resistant E. faecalis. 

Our findings provide proof of concept for a novel therapeutic strat- 
egy to specifically decolonize antibiotic-resistant enterococci from the 
gastrointestinal tract (of patients) and thereby prevent the emergence 
of resistant enterococcal infections that are otherwise difficult, or 
impossible, to treat. Although stable colonization by a bac-21- 
producing E. faecalis strain (EF + pPD1::AbacAB bacA-E +) can elim- 
inate multidrug-resistant enterococci from the gut, we expect that the 
bacteriocin-producing strain may itself eventually acquire antibiotic 
resistance determinants, compromising its utility. Thus, therapeutic 
implementation of this strategy will probably require a modified 
approach. Nevertheless, use of a commensal bacterium to deliver 
bac-21 leverages the niche specificity of the therapeutic strain to pro- 
duce bac-21 directly in the appropriate niche(s) at levels sufficient to 
inhibit other E. faecalis, yet not perturb the overall community. We 
expect that the concept presented here—specific inhibition of particu- 
lar members of the gut ecosystem—will be widely applicable to modu- 
late gut colonization by other problematic organisms as well. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Bacterial strains, growth media and chemicals. The strains used in this study are 
listed in Supplementary Table 5. Brain-heart infusion medium (BHI) and 
m-Enterococcus agar (Ent-agar) (Difco) were prepared as described by the man- 
ufacturer (Becton Dickinson). Antibiotics were purchased from Sigma and used at 
the following concentrations: rifampicin, 200 pig ml}; spectinomycin, 500 pg ml}; 
erythromycin, 10 pgml"'; chloramphenicol, 15 pg ml”; tetracycline, 10 pg ml. 
E. faecalis was cultured in BHI media at 37°C. All restriction enzymes were 
purchased from New England BioLabs. Phusion High-Fidelity DNA 
Polymerase (Thermo Scientific) was used for all PCRs performed for strain and 
plasmid construction. Oligonucleotides were synthesized by Fisher Scientific 
(Supplementary Table 6). 

Animals. The committee for animal care and use at the Medical College of 
Wisconsin approved all animal-related procedures and experiments. Five-week- 
old male FVB and C57Bl/6 mice were obtained from Taconic Laboratories 
(Oxnard and Germantown facilities). Upon arrival, mice were allowed to adapt 
to the new environment for at least 1 week before the start of any experiment. 
Animals were housed under specific-pathogen-free conditions in the Medical 
College of Wisconsin vivarium. Experimental sample sizes were determined by 
appropriate husbandry considerations as determined by the Medical College of 
Wisconsin vivarium and experiments were repeated as described. No blinding was 
performed and no scheme of randomization was applied when allocating mice for 
the experiments. 

Generation of bacAB deletion in pPD1. A mutant lacking bacAB was con- 
structed using a marker gene (ermC) insertion in an in-frame deletion. The design 
for in-frame deletion and the procedure for mutant construction were similar to 
those used previously with pCJK218 (ref. 21). The PCR products with the first and 
last codons in the bacAB fragment and flanking the ermC gene were inserted 
between the XbaI and Sphl sites of pCJK218 by Gibson Assembly” to generate 
pSK21. Deletion mutants were isolated by plating on counter-selection medium 
containing 4-chloro-pL-phenylalanine followed by prolonged incubation at 30 °C 
(ref. 21). The desired mutants harbouring the bacAB deletion in pPD1 plasmids 
(pSK35) were identified by PCR analysis and erythromycin selection. Two inde- 
pendent isolates were obtained and were analysed to exhibit identical phenotypes. 
Complementation studies. The bacABCDE fragment was amplified using pri- 
mers listed in Supplementary Table 6 and was cloned into a pAM401 vector using 
the Gibson assembly” approach, resulting in pSK29. Strains with a bacAB deletion 
in pPD1 (CK135 pSK35 or OGI1sp pSK35) were electroporated with pSK29. PCR 
analysis, chloramphenicol resistance and bacteriocin assay confirmed the com- 
plementation construct. 

Bacteriocin assays. As previously described’’, 50 pl of an overnight culture of 
the indicator strain (CK135 or OG1sp) grown in BHI broth was added to 5 ml of 
molten BHI soft agar and then spread evenly onto a BHI agar plate. After solidi- 
fication, 2 ll of the test strain that was grown overnight was spotted onto the soft 
agar. Zones of inhibition of the susceptible strain around the spots were monitored 
after overnight incubation at 37 °C. 

Mouse colonization model. Overnight cultures of E. faecalis were washed with 
sterile water and added to autoclaved water to a final concentration of 
5X 10°CFU ml". Persistence of E. faecalis in drinking water was determined 
daily and remained between 10’ and 10° CFU ml” over 3 days. Drinking water 
was changed every 3-4 days to maintain the appropriate inoculum and mice were 
allowed to drink ad libitum. After two weeks, the inoculated drinking water was 
replaced with sterile water for the duration of the experiment. For the sequential 
colonization experiment, mice were first colonized with a strain lacking pPD1 
(initial strain) by feeding the bacteria through drinking water for two weeks. Three 
days after the initial strain was withdrawn from drinking water, the challenge 
strain was introduced in the drinking water for two weeks, after which animals 
were returned to regular sterile water (week 0). 

Bacterial culture and quantification of E. faecalis from mouse faeces and 
intestines. Faecal pellets obtained directly from living mice were weighed and 
homogenized in 1 ml PBS. Faecal homogenates were plated in serial dilution on 
BHI agar plates with appropriate antibiotics to enumerate levels of E. faecalis 
colonization. After mice were euthanized, the intestinal tract was removed and 
divided into three segments: distal 10 cm of the small intestine, caecum and large 
intestine. Each segment was homogenized in 2 ml PBS and cultured on BHI media 
with appropriate antibiotics. To enumerate indigenous enterococci or total enter- 
ococci, faecal and tissue homogenates were plated in serial dilution on Ent-agar 
(Difco). Alternative to BHI, laboratory strains were enumerated on Ent-agar 
supplemented with appropriate antibiotics. 

Colony PCR for bacA and bacD genes. To determine transconjugants, colonies 
were selected and used as PCR templates for identification of the bacA or bacD 
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gene using gene-specific primers as described in Supplementary Table 6. PCR 
products were identified using agarose gel electrophoresis. 

Plasmid DNA preparation and sequencing of pPD1. The pPD1 plasmid was 
purified as described previously”’. Sequencing was performed at GENEWIZ, Inc. 
(South Plainfield, New Jersey) on the Illumina HiSeq Platform in a 2 * 150-bp 
paired-end configuration. The de novo assembly using the CLC Genomics 
Server 6.5.1 was used to obtain assembled contigs/scaffolds. In total, 59 ORFs 
larger than 100bp showing sequence similarity to known sequences in the 
NCBI database were identified by the NCBI ORF finder (http://www.ncbi.nlm. 
nih.gov/projects/gorf/). 

Bacterial genomic DNA extraction. Caeca were isolated from experimental ani- 
mals and homogenized as described”*. Genomic DNA was extracted from tissues 
using the MO BIO PowerFecal DNA Isolation kit (MO BIO, Carlsbad, California) 
with slight modification in the protocol. After addition of solution C1 and heating 
the samples at 65 °C for 10 min, the sample was subjected to further heating at 
95 °C for 10 min followed by vigorous bead beating using PowerLyzer (MO BIO). 
Amplification of bacterial 16S gene sequences and high-throughput sequen- 
cing. The 16S rDNA V4 region amplicons (single index) were produced by PCR 
and sequenced on the MiSeq platform (Illumina) using the 2 X 250-bp protocol 
yielding paired-end reads that overlap by ~247bp (ref. 25) (performed by 
Metanome, Baylor College of Medicine, Waco, Texas). Following sequencing, 
raw BCL (binary base call) files were retrieved from the MiSeq platform and called 
into fastqs by Casava v1.8.3 (Illumina). The read pairs were demultiplexed based 
on unique molecular barcodes, filtered for PhiX using Bowtie2 v2.2.1 (ref. 26) and 
reconstituted into two fastq files for each read using standard BASH. Sequencing 
reads were merged (allowing 4 mismatches per 50 bases or more) and processed 
using USEARCH v7.0.1001 (ref. 27). Sequences were demultiplexed using QUME 
v1.8.0 (ref. 28) and then clustered using the UPARSE pipeline’. Operational 
taxonomic unit (OTU) classification was achieved by mapping the UPARSE 
OTU table to the SILVA database”. Abundances were recovered by mapping 
the demultiplexed reads to the UPARSE OTUs. A custom script constructed an 
OTU table from the output files generated in the two previous steps. The OTU 
table was used to calculate alpha and beta diversities and provide taxonomic 
summaries’, 

Bioinformatic analysis. Microbiome data were analysed using the Vegan*’ and 
Ecodist™ packages in R 3.0.2 (ref. 35). Sequence counts of each sample were 
normalized to the average sequencing depth, and the Bray-Curtis metric was used 
to assess intersample (beta) diversity. Statistical significance for differences in 
microbiome diversity between groups was determined using Adonis (in Vegan). 
NMDS ordination (in Ecodist) was used to visualize group clustering and diversity 
distance between samples. Log-transformed abundance data (logy, of sequence 
count + 1) was used to normalize the data and statistically significant differences 
in OTU abundance between groups were determined by heteroscedastic, two- 
sided Student’s t-tests. 

Statistical analysis of data. To compare the E. faecalis in faeces/tissues under 
different experimental conditions, we applied either a non-parametric Mann- 
Whitney-Wilcoxon or Kruskal-Wallis test owing to the skewness of the data. 
To model the response changes over time, we fitted an exponential decay in the 
following form: 


EF in faeces(log CFU per g) =kye ** 


where ky is the E. faecalis in faeces (log CFU per g) at time 0, k, is the rate of decay 
and f is time. 

Wealso fitted a logistic curve to model the percentage transconjugants using the 
following equation: 


ky 


; 0, — 
Percentage transconjugants (%) = = s 


where k, is the maximum value of the curve, kp is the steepness of the curve, kg is 
the time of the midpoint of the sigmoid and t is time. 

For the non-linear mixed-effects models, we used a compound symmetry cor- 
relation structure to model the dependency among observations. If compound 
symmetry was not a good fit, a general correlation matrix or AR(1) (autocorrela- 
tion structure of order 1) correlation matrix was used. For responses that had non- 
zero values only at one or two time points, a model could not be fit. In these cases, 
we compared the responses at each time point. A linear mixed-effect model with 
autoregressive covariance structure was used to compare the E. faecalis in faeces 
over time for E. faecalis and E. faecalis + pPD1. Mice were treated as random. The 
response was log transformed and groups were compared using a two-tailed t-test 
at an alpha of 0.05. With 5 mice in each group, we had at least 80% power to detect 
a difference of at least 2.4 standard deviations. PASS 2008 was used for the cal- 
culation (http://www.ncss.com). SAS (SAS Institute, Cary, North Carolina), 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


S-PLUS (Insightful Corporation, Seattle, Washington), R (ref. 35) and SPSS (IBM 
Corporation, Armonk, New York) were used for statistical analyses. All tests were 
two-tailed. A P-value <0.05 was considered statistically significant. 
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strains were enumerated weekly as described above. In both a and b, horizontal 
lines indicate geometric means and each symbol represents an individual 

animal. Data are representative of more than three experiments in a; in b, data 
are representative of more than three experiments for V583, and OGIRF and 


Extended Data Figure 1 | EF colonization. a, C57Bl/6 mice (n = 5) were 
given EF, in drinking water for 14 days. Faecal samples were taken from each 
animal at the transition to sterile drinking water (week 0) and then weekly. 
EF, abundance was determined by enumeration on brain-heart infusion (BHI) 
agar with rifampicin. b, V583, OG1RF and JL214 strains of E. faecalis that the result of one experiment for JL214. 


are rifampicin resistant were fed to groups of mice (n = 5 per group). E. faecalis 
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Extended Data Figure 2 | Map of plasmid pPD1. The 59 ORFs identified in magenta. A circular plasmid map was generated using the SnapGene software 
the nucleotide sequence of pPD1 are located on a circular map. Arrows (GSL Biotech; available at http://www.snapgene.com). Schematic diagrams of 
indicating the direction of transcription show the ORFs. Different colours multiple alignments of plasmids were produced by manually realigning the 
indicate coding regions for conjugation (black), bac operon (red) and linear plasmid maps generated by the SnapGene Viewer. 


maintenance or repair (blue). Hypothetical coding regions are shown in 
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Extended Data Figure 3 | pPD1 enhances E. faecalis competition for an enumeration on m-Enterococcus selective agar (Ent agar). Laboratory strains 
intestinal niche. a, Bacteriocin assay by the soft agar method, with of E. faecalis were enumerated using Ent agar with rifampicin (Rf) or BHI agar 
with rifampicin. d, e, At the end of week 4, animals were euthanized and 


EF, + pPD1::AbacAB bacA-E+, EF, + pPD1 and EF, + pPD1::AbacAB 

spotted on a lawn of susceptible E. faecalis. Mice (n = 5 per group) were given abundance of E. faecalis was determined in the distal small intestine (d) and 
EF,, EF, + pPD1, EF, + pPD1::AbacAB or sterile drinking water for 14 days, large intestine (e) (also see Fig. le). The results shown are representative of two 
after which all mice were given sterile water. b, c, One week (b) and four weeks _ biologically independent experiments. In b-e, horizontal lines indicate 

(c) after withdrawal of E. faecalis from the drinking water, faecal samples were geometric means and each symbol represents an individual animal. 

collected and the abundance of total enterococci was determined by 
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Extended Data Figure 4| EF + pPD1 but not EF, + pCF10 dominates the 
intestinal enterococcal population. pCF10 is a well-studied pheromone- 
inducible conjugative plasmid of E. faecalis that encodes resistance to 
tetracycline but does not encode a known bacteriocin determinant. a, EF, 

(n = 3 mice, no plasmid), EF, + pPD1 (n = 3 mice, pPD1) or EF, + pCF10, 
(n = 5 mice, pCF10) was added to the drinking water for 14 days and then 
replaced by sterile drinking water. Faecal samples were taken from each animal 
at the transition to sterile drinking water (week 0) and then weekly. E. faecalis 
abundance was determined by enumeration on BHI agar with rifampicin. An 
exponential decay model is used to fit the data and there are significant 
differences between the groups pCF10 (red) and pPD1 (blue) (P = 0.012), as 
well as between the pPD1-containing and plasmid-free groups (black) 

(P = 0.007). b, Four weeks after withdrawal of E. faecalis from the drinking 
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water, animals were euthanized and abundance of EF, + pCF10 was 
determined in the distal small intestine (DSI), caecum and large intestine (LI). 
Mice colonized with EF, + pCF10 maintained long-term faecal shedding of 
EF, + pCF10 similar to EF, and persistent colonization throughout the 
gastrointestinal tract. c, d, Abundance of enterococci in the faeces was 
determined by enumeration on m-Enterococcus agar (Ent agar), Ent agar 
with rifampicin or BHI agar with rifampicin at week 1 (c) and week 4 (d). 
Unlike EF, + pPD1, which dominated the enterococcal niche in the 
gastrointestinal tract (Fig. le), EF, + pCF10 did not outcompete the indigenous 
enterococci, colonizing at levels comparable to EF,. In b-d, horizontal lines 
indicate geometric means and each symbol represents an individual animal; 
data were obtained from one experiment. 
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Extended Data Figure 5 | Microbiome analysis. a, NMDS ordination of 
control (no EF treatment) and E. faecalis samples as separated by the Bray— 
Curtis beta diversity metric. Control (n = 5 mice) and E. faecalis samples (n = 5 
mice) are intermixed. No significant difference in beta diversity was seen 
between the two groups. Adonis P-value = 0.298. Samples are connected with 
lines to help visualize grouping. b, Analysis between E. faecalis and EF + pPD1 
samples at the level of operational taxonomic unit suggests changes in the 
abundance of four bacterial genera, in particular for Deferribacteraceae; 
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Mucispirillum and Lachnospiraceae; Incertae sedis (P = 0.0001 and P = 0.005, 
respectively; heteroscedastic two-sided Student’s t-test). Bacteria belonging 

to these two genera were tenfold and twofold lower, respectively, in 

EF + pPD1-colonized mice. (Note that the magnitude of change is shown in 
logyo scale.) Changes in Defluviitaleaceae; Incertae sedis and Lachnospiraceae; 
Blautia were not as pronounced (P = 0.03; heteroscedastic two-sided 
Student’s t-test). However, analyses at the family taxonomy level suggest 

that the change in Deferribacteraceae was statistically significant. 
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Extended Data Figure 6 | Reciprocal experiment for data from Fig. 2a,dand__ indicated by open squares; abundance of EF, is indicated by open circles. Each 
g. Groups of mice (m = 5 mice per group) were given mixtures of EF, + pPD1 symbol represents an individual animal. The differences between the two 
and EF, in drinking water at ratios of 10%/90% (a), 50%/50% (b) and groups at each week were compared using a non-parametric Wilcoxon test. In 
90%/10% (c), respectively. Faecal samples were obtained at the transition to a, no P-values were significant, in b, P< 0.005 and in c, P = 0.0122 (week 0) 
sterile drinking water (week 0) and then weekly. Abundance of each E. faecalis and P = 0.0075 (week 1-4). An exponential decay model is used for fitting the 
strain in the faeces was determined by enumeration on BHI agar with data in a and b. The results in a-c are from one experiment. 

rifampicin and BHI agar with spectinomycin. Abundance of EF, + pPD1 is 
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Extended Data Figure 7 | pPD1-associated competition in vitro. Three 
independent cultures were carried out with various mixed populations of EF, 
(-) and EF, + pPD1 (+) or EF, (-) and EF, + pPD1::AbacAB (+AbacAB) in 
10 ml BHI broth at ratios of 90%/10% (a, d), 50%/50% (b, e) and 10%/90% 
(c, f). Samples for serial dilution were taken at 0, 2, 4, 6 and 24 h after the start 
of the experiment. Abundance of each E. faecalis strain in faeces was 


determined by enumeration on BHI agar with rifampicin and BHI agar with 
spectinomycin. Evidence of conjugation was observed in in vitro co-cultures 
of a and b only by screening for transconjugants (EF, + pPD1) via colony 
PCR. Open squares represent abundance of EF, + pPD1 (a-c) or 

EF, + pPD1::AbacAB (d-f). Open circles represent abundance of EF, in all 
panels. Data are representative of two biologically independent experiments. 
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Extended Data Figure 8 | Complementation of bac-21 production 
restores colonization phenotype by providing a competitive advantage. 
Bacteriocin activity was restored upon ectopic expression of bacA to bacE 
(from pAM401) in EF + pPD1::AbacAB but not in E. faecalis lacking pPD1, 
indicating that the distal part of the bac operon (bacF to bacl) is necessary 
for bacteriocin expression and that the bacAB in-frame deletion is not polar 
on downstream genes. Mice (n = 5) were given EF, + pPD1::AbacAB 
bacABCDE-+ as described in the methods and abundance was determined by 
enumeration on m-Enterococcus (Ent) agar, m-Enterococcus agar plus 
rifampicin (Ent-Rif) or BHI agar with rifampicin (BHI-Rif). The presence of 
pAM401A::bacABCDE + (complementing plasmid) was determined by 
enumerating CFU on BHI agar with rifampicin and chloramphenicol 
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(BHI-Rif Cm). Faecal samples were obtained at week 1 (a) and week 4 (b) after 
transition to sterile drinking water. Horizontal lines indicate geometric 
means. Each symbol represents an individual animal and data are represen- 
tative of two biologically independent experiments. EF, + pPD1::AbacAB 
bacABCDE + stably colonized the gastrointestinal tract (a); however, in the 
absence of chloramphenicol selection, pAM401::bacABCDE was gradually lost 
from the population (b). Over time, loss of pAM401::bacABCDE resulted in 
the complemented strain reverting to the bacteriocin-defective AbacAB strain, 
with the loss of bacteriocin activity. Nevertheless, this strain persisted in the 
gut, suggesting that bac-21 was essential for clearing a niche for E. faecalis; once 
cleared, E. faecalis uses other mechanisms to maintain colonization. 
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Extended Data Figure 9 | EF, levels are not altered by sequential 
colonization with EF, + pPD1. Groups of mice (n = 5 mice per group) were 
given EF, in drinking water for two weeks and subsequently challenged with 
EF, + pPD1 in drinking water for another two weeks (starting from week -2) 
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before transition to sterile water (week 0). Faecal samples were obtained weekly 
to enumerate the abundance of EF, (a) and EF, + pPD1 (b) on BHI agar with 
rifampicin and spectinomycin, respectively. Each symbol represents an 
individual animal and data are from one experiment. 
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Extended Data Figure 10 | Conjugation frequency of pPD1 between the 
laboratory strain (EF + pPD1) and indigenous enterococci in vitro. To 
understand conjugation dynamics between non-isogeneic species of 
enterococci, we investigated indigenous enterococcal transconjugants in mice 
that were colonized with EF, + pPD1 (Fig. le and Extended Data Fig. 3b-e). 
However, we were unable to detect rifampicin-sensitive enterococci from the 
faecal sample at week 4 (Extended Data Fig. 3c). At week 1, only nine clones 
of rifampicin-sensitive enterococci (out of 730 enterococci) were isolated 
from three mice in a group of five (Extended Data Fig. 3b). Bacteriocin assays 
and probing for the bacA gene sequence confirmed that six of the nine clones 
were transconjugants and were bac-21 positive. 16S rDNA gene sequencing 
of these nine clones showed notable similarities to E. faecalis 16S rDNA. To 
understand the reason for the low conjugation frequency between indigenous 
enterococci and the laboratory strain of EF, + pPD1, in vitro conjugation 
experiments were performed to assess the frequency of plasmid transfer. Ten 
new clones of indigenous enterococci were isolated from the faeces of five 
mice (two clones per mouse that were not colonized with any laboratory strain) 
by culturing on m-Enterococcus agar. EF, + pPD1 was mixed with each of 
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the ten indigenous enterococci clones in BHI broth at a ratio of 1:9. Samples for 
serial dilution were taken at the start of the culture (a) and 24h after the start of 
the experiment (b). Abundance of total enterococci was determined using 
m-Enterococcus agar and BHI agar with rifampicin for EF, + pPD1. Data are 
representative of three biologically independent experiments. In vitro 
bacteriocin assays revealed that EF, + pPD1 is capable of killing most of the 
non-pPD1-containing indigenous enterococci strains (not shown). In vitro 
conjugation assays between individual indigenous enterococci clones and 

EF, + pPD1 led to three observations: first, 4 out of 10 clones were susceptible 
to bac-21 and were eliminated by EF, + pPD1 (no. 3, 4, 8 and 9); second, 6 
out of 10 were immune to bac-21 killing (no. 1, 2, 5, 6, 7 and 10); third, probing 
for bacA provided evidence for pPD1-containing indigenous enterococci 
transconjugants in 4 out of the 6 immune indigenous enterococci clones. The 
two clones that failed to conjugate and were resistant to bac-21 killing in the 
mixed culture experiment were also resistant to EF, + pPD1 on bacteriocin 
assay plates. The mechanism for resistance of these two clones is not clear; 
however, they might have harboured cross-resistance traits. Error bars 
represent s.e.m. 


©2015 Macmillan Publishers Limited. All rights reserved 


Mae A tea 


doi:10.1038/nature15375 


Crystal structure of the 500-kDa yeast acetyl-CoA 
carboxylase holoenzyme dimer 


Jia Wei’ & Liang Tong! 


Acetyl-CoA carboxylase (ACC) has crucial roles in fatty acid meta- 
bolism and is an attractive target for drug discovery against 
diabetes, cancer and other diseases’ °. Saccharomyces cerevisiae 
ACC (ScACC) is crucial for the production of very-long-chain fatty 
acids and the maintenance of the nuclear envelope”*. ACC contains 
biotin carboxylase (BC) and carboxyltransferase (CT) activities, 
and its biotin is linked covalently to the biotin carboxyl carrier 
protein (BCCP). Most eukaryotic ACCs are 250-kilodalton 
(kDa), multi-domain enzymes and function as homodimers and 
higher oligomers. They contain a unique, 80-kDa central region 
that shares no homology with other proteins. Although the struc- 
tures of the BC, CT and BCCP domains and other biotin-depend- 
ent carboxylase holoenzymes are known’ “*, there is currently no 
structural information on the ACC holoenzyme. Here we report 
the crystal structure of the full-length, 500-kDa holoenzyme dimer 
of ScACC. The structure is remarkably different from that of the 
other biotin-dependent carboxylases. The central region contains 
five domains and is important for positioning the BC and CT 
domains for catalysis. The structure unexpectedly reveals a dimer 
of the BC domain and extensive conformational differences com- 
pared to the structure of the BC domain alone, which is a mono- 
mer. These structural changes reveal why the BC domain alone is 
catalytically inactive and define the molecular mechanism for the 
inhibition of eukaryotic ACC by the natural product soraphen 
A’*?° and by phosphorylation of a Ser residue just before the BC 
domain core in mammalian ACC. The BC and CT active sites are 
separated by 80 A, and the entire BCCP domain must translocate 
during catalysis. 

The primary sequences of the single-chain, multi-domain eukaryo- 
tic ACCs can be divided into three regions of roughly equal sizes. The 
amino-terminal region (residues 1-795 in SCcACC, Fig. 1a) contains the 
BC and BCCP domains, with possibly a BT (BC-CT interaction) 
domain between them, as observed in the structures of propionyl- 
CoA carboxylase (PCC)" and 3-methylcrotonyl-CoA carboxylase 
(MCC)”. The carboxy-terminal region (residues 1492-2233) contains 
the N and C domains of CT””. The structure and function of the central 
region (residues 796-1491) are currently not known. It is not as well 
conserved among the eukaryotic ACCs (Extended Data Figs 1-3). BC 
catalyses the MgATP-dependent carboxylation of the N1’ atom of 
biotin (Extended Data Fig. 4). The carboxybiotin (and BCCP) then 
translocates to the CT active site, where the substrate acetyl-CoA is 
carboxylated. 

We expressed in Escherichia coli the 250-kDa ScACC (residues 
22-2233) and determined its crystal structure at 3.2A resolution 
(Extended Data Table 1, Extended Data Fig. 4). The structure of the 
ScACC holoenzyme is remarkably different from that of the other 
biotin-dependent carboxylases””'*. The 500-kDa holoenzyme dimer 
obeys two-fold symmetry, and its overall structure is shaped like a 
quarter of a disk (Fig. 1b), with a radius of ~140A and thickness 
of ~120A (Fig. 1c), although there is a large channel measuring 
~30 A across through the centre of the holoenzyme (Extended Data 
Fig. 5). A BC domain dimer (Fig. 1d) is located near the centre of the 


disk, while the CT domain dimer (Fig. le) forms a part of the edge of 
the disk. A BCCP domain is positioned near the centre of each face of 
the holoenzyme, and its biotin is located in the CT active site (Fig. 1b). 

We have also determined the structure at 3.1 A resolution of the 
holoenzyme where the BCCP domain is not biotinylated (Extended 
Data Table 1). The overall structure of this dimer is essentially the same 
as the other structure, with r.m.s. distance of 0.45 A for their 3,928 
equivalent Ca atoms, although the BCCP domain is disordered in the 
absence of biotinylation. Earlier studies showed that biotinylation 
stabilizes E. coli BCCP’*”’. This structure of the holoenzyme will not 
be discussed further here. 

The overall structures of the two protomers of the holoenzyme are 
similar, with r.m.s. distance of 1.1 A for 1,862 equivalent Cx atoms 
located within 3 A of each other after superposition. On the other 
hand, the r.m.s. distance is only 0.76 A if the CT domains are super- 
posed, and differences in the orientation and position of the other 
domains are visible, especially for the BC domain, which is located 
furthest from CT (Extended Data Fig. 4). Within CT, conformational 
differences in the small inserted domain’’ are observed (Extended 
Data Fig. 5), likely to be linked to differences in the position of 
BCCP in the two protomers, as the insert domain has direct contacts 
with BCCP. 

The structure reveals that the central region of SCACC contains five 
domains, which we have named ACC central (AC) domains AC1 
through AC5, giving a total of 10 major domains for each ScACC 
protomer (Fig. la). Domains AC1, AC2 and AC3 are all helical 
(Fig. 1f, Extended Data Fig. 2). AC1 contains three pairs of anti-parallel 
helices as well as inserts of a four-helical bundle (domain AC2, helices 
03-06) and a helical hairpin (18-09, Extended Data Fig. 5). AC3 is 
also a four-helical bundle but it has no interactions with AC1 and AC2, 
and instead is positioned between AC4 and AC5, mediating interac- 
tions between them. Domain AC4 is located at the end of the disk edge, 
and is separated from the AC4 domain of the other protomer by 
~200A (Fig. 1b). Unexpectedly, the structure shows that AC4 and 
AC5 have similar backbone folds, consisting of a twisted B-sheet 
flanked on one face by helices (Extended Data Figs 4, 6). The r.m.s. 
distance is 2.9 A for their equivalent Co: atoms, but the sequence iden- 
tity is only 12%. This backbone fold has weak similarity to a part of 
formamidase” (Extended Data Fig. 6) and several other enzymes, but 
the Z score is below 6.5 and the sequence identity is less than 9%". The 
active sites of these enzymes are not conserved in AC4 or ACS. 
Therefore, this structural similarity is unlikely to have any functional 
significance for ACC. 

The structure confirms the presence of a BT domain in the 
N-terminal region of ScACC (Fig. 1a). Its structure is similar to that 
in PCC", with a central, long helix (Extended Data Fig. 4) surrounded 
by an eight-stranded anti-parallel B-barrel (Extended Data Fig. 5). The 
domain helps to mediate interactions between the BC and CT 
domains, as there are no direct contacts between them in the holoen- 
zyme (Fig. 1b). Within each protomer, the last three strands of the BT 
domain f-barrel (827-829) faces the BC domain (Figs 1b, 2a), while 
the C-terminal end of the long helix and the following loop connecting 
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Figure 1 | Crystal structure of the 500-kDa yeast acetyl-CoA carboxylase 
(ScACC) holoenzyme dimer. a, Domain organization of ScACC. The three 
regions of the sequence are also indicated. AC, ACC central. b, Overall 
structure of ScACC holoenzyme dimer. One protomer is shown as ribbons 
while the other is shown only asa surface for clarity, both coloured according to 
a. The two-fold axis of the dimer is vertical (black line). c—e, Overall structure of 


to the first strand (the hook"’) is flanked by the helical hairpin insert of 
AC1 (a8-a9) on one side and an inserted segment of a long loop 
between two B-strands (B4A and B4B) in the C domain of CT on 
the other side (Fig. 2b, Extended Data Fig. 4). In addition, a part of 
the long linker between the BCCP and AC1 domains has hydrophobic 
interactions with the top of one side of the BT domain f-barrel (Fig. 1b, 
Extended Data Fig. 5). 
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ScACC holoenzyme, viewed from the side (c), down the BC domain dimer 
(d), and down the CT domain dimer (e). The two-fold axis is indicated with the 
black oval. f, Structure of the five domains (AC1-ACS5) in the central region 
of ScACC. The arrow points to the helical hairpin insert of AC1. The structure 
figures were produced with PyMOL (http://www.pymol.org). 


A total of ~9,000 A? of the surface area of each protomer is buried in 
the dimer interface, predominantly from the BC (1,200 A”, Extended 
Data Fig. 7) and CT (5,800 A’, Extended Data Fig. 8) domain dimers. 
The BC domain contributes an additional 900 A” through contacts 
with the BT domain (500A’, Fig. 2a), AC1 (80A*) and AC2 
(230 A’, Fig. 2c) of the other protomer. The BCCP domain buries 
~400 A? in the CT active site, where it contacts the C domain of the 
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Figure 2 | Interactions among the domains in the ScCACC holoenzyme. 

a, The BT domain contacts the BC domain dimer. Side chains of residues in the 
interfaces between the BT domain (orange) and the BC domain of the same 
protomer (red, Mol 1) and the BC domain of the other protomer (salmon, 
Mol 2) are shown as stick models. b, Interactions between the hook of the BT 
domain (orange) and the helical hairpin insert of AC1 domain («8 and «9, 
green) and the B4A-[4B loop from the C domain of CT (yellow). c, Interactions 
between domains AC1 (green) and AC2 (light green) of one protomer with 
the BC domain (salmon) of the other protomer. 


other protomer, but it is expected to translocate to the BC active site 
during catalysis (see later). 

A major surprise from the structure is the observation of a BC 
domain dimer (Fig. 1d), because the BC domain alone is consistently 
a monomer and catalytically inactive based on earlier studies*"°”?”’. 
Moreover, the organization of this BC domain dimer is similar to that 
of the BC subunit dimer of E. coli ACC***® (Fig. 3a), with a mostly 
hydrophilic interface (Extended Data Fig. 7). However, the structure of 
BC domain alone is incompatible with such a dimer due to steric 
clashes between the two molecules’® (Extended Data Fig. 7). Large 
conformational changes for residues in the dimer interface are there- 
fore necessary for the formation of this dimer, primarily involving the 
B-strands and connecting loops in the C sub-domain of BC (Fig. 3b). 
Especially, strand B18 moves by ~8 A, taking it out of the central 
B-sheet of the C-domain (Fig. 3c, Supplementary Video 1). B18 instead 
forms a B-sheet with a new strand, B21, which is not present in the 
structure of BC domain alone. Residues in the 817-818 loop move by 
up to 20A, and those in the B18-B19 loop by up to 7 A (Fig. 3c). In 
addition, the main chain of neighbouring strands B17, B19 and B20 
shifts by ~3 A. The new B18-loop-B19 structure is in the centre of the 
BC domain dimer (Fig. 3a), where the tip of this loop contacts the side 
chain of Trp487 in the other protomer (Extended Data Fig. 7). 

The two distinct conformations of this domain explain why it is cat- 
alytically inactive on its own’* and also define the molecular mechanism 
for the inhibition of eukaryotic ACC by the natural product soraphen A’® 
and by phosphorylation of a Ser residue just before the BC domain core 
in mammalian ACC (Ser80 in human ACC1 and Ser222 in ACC2)” 
(Extended Data Fig. 1). As a part of the conformational change, residues 
in the B19-B20 loop move by up to 10 A (Fig. 3d, Supplementary Video 
2). This loop in the structure of BC domain alone is likely to interfere with 
the binding of BCCP-biotin (Fig. 3d, Extended Data Fig. 7), based on the 
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Figure 3 | A dimeric BC domain in the ScACC holoenzyme. a, Overlay of the 
BC domain dimer of ScACC (in red and salmon) with the BC subunit dimer 
of E. coli ACC (grey) in complex with ADP (green), bicarbonate (black), 
biotin (black), and Mg”* (pink sphere)”®. The two molecules at the top are 
superposed, and the two molecules at the bottom have ~12° difference in 
orientation. b, Overlay of the BC domain of ScACC holoenzyme (in red) with 
the BC domain alone (in grey) in complex with soraphen A (green, labelled 
Sor)'’. The region of large conformational differences is highlighted in light 
blue. The view is the same as in a. c, Detailed view of the conformational 
changes in the dimer interface. Blue arrows indicate some of the changes from 
the BC domain alone (grey) to the BC domain in the holoenzyme (red). 

d, Conformational changes near the biotin binding site, especially the B19-B20 
loop. This is coupled to changes in the B2 to «F segment. A possible 
hydrogen bond between the amide linkage of biotin and the 819-820 loop is 
indicated with the dashed lines (red). e, A model for how conformational 
transitions in the dimer interface affect catalysis and dimerization of the 
eukaryotic BC domain, updated from an earlier model’>. Biotin is shown as 
the fused black pentagons, while soraphen A and phosphorylated serine are 
indicated by Sor and P, respectively. Bacterial BC subunit does not 

undergo the conformational transition, and its monomers can be 
catalytically active”. 


binding mode of biotin to E. coli BC”’. Consequently, BC domain alone is 
catalytically inactive because it assumes a conformation that cannot bind 
the BCCP-biotin substrate. 

Soraphen A recognizes the conformation of isolated BC domain’®, 
and this binding site does not exist in the BC domain dimer in the 
holoenzyme owing to the structural changes (Fig. 3c). For example, 
strands $19 and B20 move into the binding site, and especially the side 
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Figure 4 | Translocation of BCCP and biotin during ACC catalysis. The BC 
and CT active sites (black asterisks) of ScACC are separated by ~80 A (arrow). 
A rotation of ~180° around the BT-BCCP linker (curved arrow in blue) 
could place the biotin into the BC active site. The prime in the labels indicates 
the second protomer. The binding modes of ADP (green) and biotin (black) 
to E. coli BC subunit are shown as stick models”®. 


chain of Trp487 ($19) is in direct clash with soraphen A (Fig. 3c, 
Extended Data Fig. 7). Therefore, soraphen A inhibits the enzyme 
allosterically by stabilizing a catalytically inactive conformation of 
the BC domain. In the context of the holoenzyme, soraphen A binding 
will disrupt the formation of BC domain dimer (Fig. 1d), as the other 
protomer clashes with the compound as well (Extended Data Fig. 7). 
This could also be detrimental for catalysis as the BC domains may not 
be positioned correctly to accept the BCCP-biotin for carboxylation. 

Upon phosphorylation, the peptide segment containing pSer222 of 
human ACC72 is located in the same binding site as soraphen A”’, and 
the Trp487 side chain in the holoenzyme structure also clashes with 
this segment (Extended Data Fig. 7). Therefore, phosphorylation of 
this Ser residue inhibits the enzyme through the same mechanism as 
that of soraphen A. These structural observations greatly extend a 
model for the inhibitory mechanism proposed earlier* (Fig. 3e). 
ScACC does not have an equivalent phosphorylation site in this region 
of the sequence. 

In comparison, the structure of the CT domain dimer in the holoen- 
zyme is essentially the same as that of the domain alone”, with r.m.s. 
distance of 0.61 A among their equivalent Co atoms (Extended Data 
Fig. 8). The binding of both BCCP-biotin and CoA (Extended Data 
Fig. 4) in one of the CT active sites provides direct insights into the 
catalysis by this enzyme. The thiol group of CoA is 4.3 A away from the 
Nl’ atom of biotin (Extended Data Fig. 8). Therefore, the two 
substrates are likely to be in the correct positions for catalysis. The 
position of biotin clashes with that of the compound CP-640186 
(Extended Data Fig. 8), a nanomolar inhibitor of mammalian 
ACCs”, confirming that it functions by blocking biotin binding to 
the CT active site”*. 
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Prior to the structure determination of the holoenzyme, we obtained 
the crystal structures for residues 797-1033 (domains AC1-AC2), 
1036-1503 (AC3-AC5), and 569-1494 (BT, BCCP and the entire 
central region) (Extended Data Table 1). Comparisons of the struc- 
tures of these domains alone with that of the holoenzyme reveal sub- 
stantial variability in the relative positioning of the domains. Domains 
AC3-AC4 in the structure of AC3-AC5 alone can be readily super- 
posed with those in the holoenzyme, but then the orientation of 
domain AC5 differ by 40° (Extended Data Fig. 9). Even more variabil- 
ity is observed in the structure of BT-BCCP-AC1-ACS5 alone, illu- 
strated by the differences in the positioning of AC1-AC72 relative to the 
BT domain and AC3-AC5 (Extended Data Fig. 9). Interestingly, 
between the two unique copies of the BT-BCCP—AC1-AC5 molecule 
in the crystal, one has a conformation of AC3-ACS5 that is very similar 
to that in the holoenzyme (Extended Data Fig. 9), while the other is 
similar to that in AC3—ACS5 alone, suggesting that domains AC3-AC5 
may assume two (or more) distinct conformations. 

Another discovery from the structure is that the central region has 
minimal contributions to the formation of the dimer. Besides the 
~300 A? surface area burial for AC] and AC2 (Fig. 2c), the central region 
has no contacts with the other protomer (Fig. 1b). On the other hand, the 
structure suggests that the central region is important for maintaining the 
BC and CT dimers in the correct relative positions for catalysis. The CT 
domain dimer is sandwiched by ACS on both sides (Fig. le). The BT and 
AC2 domains of the two protomers form a platform, keeping the BC 
domain dimer in place and possibly also helping with BC domain dimer- 
ization (Fig. 1b). The conformational variability observed for the 
domains in the central region (Extended Data Fig. 9) might have a role 
in regulating the activity of the holoenzyme. 

The structural analysis indicates that BCCP-biotin becomes 
carboxylated in the BC active site of its own protomer, and then 
translocates to the CT active site at the dimer interface, where it con- 
tacts the C domain of the other protomer (Fig. 4). The distance 
between the BC and CT active sites is ~80 A, indicating that the entire 
BCCP domain must translocate during catalysis (swinging-domain 
model, Extended Data Fig. 4), as has been observed in the other 
holoenzymes’. In fact, the linker from BT to BCCP (residues 
697-700) is about 45 A from the N1’ atom of biotin, and therefore a 
rotation of this linker by ~180° could bring the biotin into the BC 
active site from the CT active site (Fig. 4). The linker from BCCP to 
ACI (residues 770-795) is much longer and can accommodate such a 
rotation, and part of this linker is disordered in the current structure. 

We introduced mutations in the interfaces in the holoenzyme to 
assess the structural information. The mutants that could be purified 
migrated at the same position on a gel filtration column and had nearly 
the same thermal melting curves as the wild-type enzyme (data not 
shown). Deletion of the o-helical hairpin (%8-«9, residues 940-972) in 
domain AC1 (Extended Data Fig. 2) or the B4A-B4B loop in the C 
domain of CT (residues 1902-1916, Extended Data Fig. 3) abolished 
the catalytic activity (Table 1), demonstrating their importance in 
anchoring the hook of the BT domain (Fig. 2b). On the other hand, 
mutating Gln608 in the hook, which interacts with the main chain of 


Table 1 | Effects of mutations in the ScACC holoenzyme interfaces on 
the catalysis 


Enzyme Km (mM)* Keat (St) 
Wild-type ScACC 0.053 + 0.011 85+04 
A940-972 (48-09 hairpin of AC1) No activity detected 

A1902-1916 (B4A-B4B loop of CT) No activity detected 

A836-918 (AC2) No expression 

K73E Very low activity 

R76E No activity detected 

Y83A 0.074 + 0.020 16+0.1 
W487A Very low activity 

Q608R 0.11 + 0.02 16+ 1.0 
R656E 0.15 + 0.07 5.0+06 


* The errors were obtained from fitting data to the Michaelis-Menten equation. 
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the B4A-B4B loop (Fig. 2b), had little effect on catalysis. Similarly, the 
R656E mutation at the edge of the BT-BC interface (Fig. 2a) had little 
effect, suggesting that these single-site mutations are not sufficient to 
disrupt the holoenzyme or that these regions of contact do not 
contribute significantly to the interactions. 

We introduced the K73E, R76E and W487A mutations in the BC 
domain dimer interface (Extended Data Fig. 7), and all three essentially 
abolished the catalytic activity (Table 1). The mutations probably 
disrupted the BC domain dimer, and the domain changed to the other 
conformation that is incompatible with catalysis. The K73E and R76E 
mutants are equivalent to the R16E and R19E mutants of E. coli BC 
subunit that we characterized earlier”. Those mutations greatly 
destabilized the dimer, but had only a small effect (about three-fold) 
on the catalytic activity of the BC subunit alone, probably because the 
E. coli BC monomer does not undergo the conformational transition as 
observed here for the eukaryotic BC (Fig. 3e). However, the RI9E 
mutation had a much larger effect in vivo, probably in the context of 
the E. coli ACC holoenzyme”. 

Overall, our studies have produced the first structural information 
on the 500-kDa yeast ACC holoenzyme dimer. This structure is likely 
to have relevance for other eukaryotic ACCs, especially the human 
ACC holoenzymes, as they share 45% sequence identity with yeast 
ACC. The structures of the BC and CT domains of human and yeast 
ACCs are highly similar and consistent with their strong sequence 
conservation (Extended Data Figs 1, 3). Although the sequence 
conservation for the central region is weaker, the secondary structure 
elements in yeast ACC are predicted to be present in human ACC as 
well, and residues in these secondary structure elements are more 
conserved than those in the loops (Extended Data Fig. 2). Therefore, 
the overall structure of human ACC holoenzyme is likely to be similar 
to that of yeast ACC. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Protein expression and purification. The N-terminal segment of the central 
region of Saccharomyces cerevisiae ACC (ScACC, gene ACC1, residues 
797-1033, AC1-AC2 domains) was expressed at 20°C in E. coli BL21(DE3) 
Star cells for native protein or B384(DE3) cells for selenomethionyl protein, in 
the presence of a chaperone plasmid pG-KJE8 (TaKaRa). The recombinant pro- 
tein carried a C-terminal His-tag and was purified by Ni-NTA (Qiagen) and gel 
filtration chromatography (Sephacryl S-300, GE Healthcare) in buffer A (20 mM 
Tris (pH 7.5), 300 mM NaCl, and 2mM DTT). 

The C-terminal segment of the central region of SCACC (residues 1036-1503, 
AC3-AC5 domains) was expressed in E. coli B834(DE3) cells at 25 °C for seleno- 
methionyl protein and purified following the same protocol. The gel filtration 
buffer contained 10 mM rather than 2mM DTT. 

The segment containing the BT-BCCP-AC1-AC5 domains of ScACC (resi- 
dues 569-1494) was expressed in BL21(DE3) Rosetta cells at 24°C. The recom- 
binant protein, with a C-terminal His-tag, was purified by Ni-NTA and gel 
filtration chromatography in buffer A. 

Full-length ScACC (residues 1-2233) was constructed into pET28a (Novagen) 

by sewing together two PCR fragments with a 900-nt overlap. For structure deter- 
mination, the segment containing residues 22-2233, with a C-terminal His-tag, 
was expressed in E. coli BL21(DE3) Star cells at 25 °C. Protein expression for all the 
different ScCACC segments was driven by the trp promoter and was induced with 
3-indoleacrylic acid. However, the endogenous E. coli biotin-protein ligase (BPL, 
also known as BirA) was not able to biotinylate the ScCACC. The ScBPL gene was 
amplified from the genome and inserted into the pCDFDuet-1 vector (Novagen) 
multiple cloning site 2 without any affinity tag. Co-expression of ScBPL and the 
inclusion of 20mg1~’ biotin in the media allowed complete biotinylation of 
ScACC, confirmed by an avidin shift assay. The protein was purified by 
Ni-NTA and gel filtration chromatography in buffer A. Typically 0.5 mg of bio- 
tinylated ScACC (or 3 mg of unbiotinylated ScACC) could be purified from 12 | 
of culture. 
Protein crystallization. Crystals were obtained at 20°C using the sitting-drop 
vapour diffusion method. Native and selenomethionyl crystals of the ACI-AC2 
domains were obtained using a precipitant solution of 90 mM Bis-tris propane and 
60 mM citric acid (pH 6.4), and 20% (w/v) PEG3350. The protein concentration 
was 12 mg ml, and the crystals took 3 weeks to reach full size. 

Selenomethionine-substituted crystals of the AC3-AC5 domains were obtained 
after 2 days. The protein concentration was 3.6 mg ml _', and the precipitant solu- 
tion contained 80 mM HEPES (pH 7.5), 4% (v/v) MPD, 8mM sodium citrate, 
4% (v/v) glycerol, and 40 mM NDSB-201. 

Native crystals of the BT-BCCP-AC1-AC5 domains were obtained after 
4 days. The protein concentration was 15 mg ml, and the precipitant solution 
contained 80 mM HEPES (pH 7.5), 9.6% (w/v) PEG6000, 1.6% (v/v) MPD, 60 mM 
sodium citrate, and 80 mM Nal. 

Full-length ACC protein was incubated with 3.3mM acetyl-CoA and 
3.3mM Mg-ADP for 30 min on ice before crystallization. Crystals were obtained 
after 2 weeks. The protein concentration was 5mgml’, and the precipitant 
solution contained 14% (w/v) PEG3350, 4% (v/v) tert-butanol, and 0.2 M sodium 
citrate. 

Glycerol was used as the cryo-protectant and all crystals were flash frozen in 

liquid nitrogen for data collection at 100 K. 
Data collection and structure determination. X-ray diffraction data of AC1- 
AC2 domains were collected on an ADSC Q315 CCD at the X29A beamline of the 
National Synchrotron Light Source (NSLS). The diffraction images were processed 
with the HKL program*’. The crystal belonged to space group P6; with cell para- 
meters of a= b = 117.6A, and c= 73.8A. There is a domain-swapped dimer of 
the protein in the asymmetric unit. A selenomethionyl single-wavelength anom- 
alous diffraction (SAD) data set was collected to 3.0A resolution (wavelength 
0.979 A) and a native data set to 2.5 A resolution (wavelength 1.075 A). Five Se 
atoms were located with program Solve* and used for phasing with program 
Phenix. The phase information was then extended to 2.5 A with solvent flatten- 
ing, histogram matching and two-fold non-crystallographic symmetry (NCS) 
averaging using the program DM in CCP4™. The atomic model was built into 
the electron density map manually with the program Coot’. Structure refinement 
was performed with CNS” and Refmac5*’. 


A selenomethionyl SAD data set to 3.2 A resolution of AC3-AC5 domains was 
collected at the X29A beamline (wavelength 0.979 A). The crystal belonged to 
space group P2,, with cell parameters of a=56.8A, b=93.3A, c=111.1A, 
and B = 100.6°. There are two molecules in the asymmetric unit. Five Se atoms 
in each molecule were located and employed for phasing with program Phenix. An 
atomic model was manually built into the electron density map with program 
Coot, and the structure was refined with Refmac5. 

A native diffraction data set to 3.0A resolution of BT-BCCP-AC1-AC5 
domains was collected at the X29A beamline (wavelength 1.075 A). The crystal 
belonged to space group P2,, with cell parameters of a = 93.3 A, b=149.7A, 
c=95.4A, and B = 118.4°. There is a dimer of the protein in the asymmetric unit. 
The structures of AC1-AC2 and AC3-ACS5 domains were used as the search 
models to solve the structure by molecular replacement with the program 
Phaser**. The BT and BCCP domains were manually built into the electron 
density map. 

A native diffraction data set to 3.1 A resolution of residues 22-2233 of unbio- 
tinylated ScACC was collected on a Pilatus 6M detector at the X25 beamline of 
NSLS (wavelength 1.100 A). There is a dimer of SCACC in the asymmetric unit. 
The structure was solved by molecular replacement with the program Phaser. The 
structures of AC1-AC2, AC3-ACS5, and BT domains reported here and previously 
published structures of yeast BC’® and CT domains’ were used as the search 
models. However, no electron density for the BCCP domain was observed based 
on the crystallographic analysis. 

A native diffraction data set to 3.2 A resolution of biotinylated ScACC was 
collected at the X25 beamline. The crystal belonged to space group P432,2, with 
cell parameters of a = b = 159.9 A, and c= 614.4 A, isomorphous to that of the 
unbiotinylated ScACC. The final atomic model was built with Coot and refined 
with Refmac5. NCS restraints were used during the refinement. Crystals of full- 
length ScACC (residues 1-2233) did not diffract beyond 8 A resolution even after 
extensive efforts. The geometry of the final model was validated with MolProbity”’. 
Mutagenesis and kinetic assays. Site-specific and deletion mutations were intro- 
duced with the QuikChange kit (Agilent) and sequenced for confirmation. In 
deletion mutants, residues 940-972 of the o-helical hairpin (#8-«9) in domain 
AC1 and residues 836-918 in domain AC2 were replaced by a (GS); linker, 
respectively, while residues 1902-1916 of the B4A-B4B loop in the C domain of 
CT were replaced by a (GS), linker. 

The catalytic activity of ACC was determined using a coupled enzyme assay, 
converting the hydrolysis of ATP to the disappearance of NADH. The reaction 
mixture contained 100mM HEPES (pH 7.5), 8mMMgChl, 40mM KHCOs, 
200mM KCI, 0.2mM NADH, 0.5mM phosphoenolpyruvate, 0.5mM ATP, 
6units of lactate dehydrogenase (Sigma), 4 units of pyruvate kinase, 100nM 
ACC and various concentrations of acetyl-CoA. The absorbance at 340 nm was 
monitored for 60s. 
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Extended Data Figure 1 | Sequence alignment of the N-terminal region of — (Ser80) is indicated with a star. This site does not exist in SCACC. ScACC, 


eukaryotic, single-chain ACCs. The region includes BC, BT and BCCP Saccharomyces cerevisiae ACC; TaACC, Triticum aestivum (wheat) ACC; 
domains (indicated). The secondary structure elements in the SCcACC DmACC, Drosophila melanogaster ACC; HsACC1, Homo sapiens ACC1. 
holoenzyme structure are shown. The site of phosphorylation in HsACC1 Modified from an output from ESPript*’. 
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»AC1 a1 a2 <> AC2 a3 a4 
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Extended Data Figure 2 | Sequence alignment of the central region of 
eukaryotic, single-chain ACCs. The AC1-AC5 domains are indicated. 
Predicted secondary structure elements in HsACC1 are also shown, 


and they generally match those in the structure of SCACC. The helices in 
AC1-AC3 are numbered consecutively. Three sites of phosphorylation 
in HsACC1 are indicated with stars. 
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Extended Data Figure 3 | Sequence alignment of the C-terminal region of eukaryotic, single-chain ACCs. The region includes N and C domains of CT 
(indicated). 
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Extended Data Figure 4 | Electron density for various regions of the and CT active sites are larger than 30 A, suggesting that BCCP needs to 
ScACC holoenzyme structure. a, Schematic of the two-step reaction in the translocate in addition to the biotin (swinging-domain model). b, 2F, — F. : 
catalysis by biotin-dependent carboxylases. Biotin is carboxylated in the BC electron density for the helical hairpin insert (%8-0.9) of domain ACI at 3.2A 


active site, and then translocates to the CT active site where the substrate is resolution, contoured at lo. c, 2F, — F. electron density for the B-sheet of 
carboxylated. The longest distance from the N1’ atom of biotin tothe Ca atom domain ACS. d, 2F, — F, electron density for the central helix («#V) of the BT 
of the Lys residue covalently attached to biotin is ~16 A, giving a reach of domain. e, Omit F, — F_ electron density for CoA at 3.2 A resolution, contoured 


~30A for biotin (swinging-arm model). The actual distance between the BC _at 2.50. f, Omit F., — F. electron density for biotin at 3.2 A resolution. 
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Extended Data Figure 5 | Overall structure of the SCACC holoenzyme. a, A 
large channel in the centre of the SCcACC holoenzyme dimer. The view is related 
to that of Fig. 1b by an ~30° rotation around the vertical axis. b, Overlay 

of the structures of the two protomers of SCACC holoenzyme dimer. One 
protomer is shown in colour and other in grey. The overlay is based on the CT 
domain. Differences in the orientations of the other domains are indicated, 
as well as the r.m.s. distance for their equivalent Cx atoms. The arrow points to 
conformational differences in the insert domain of CT, linked to differences 
in the BCCP binding mode. c, Overlay of the BT domain of ScACC (in orange) 
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Be 
CL Wy) 


Hook . ACC BT 
J > PCC BT 
and the BT domain of PCC (in grey). The ‘hook’, connecting the end of the 
helix («V) to the first strand of the B-barrel (822), is labelled. The last 
three strands of the B-barrel (827-829) are splayed further away from the 
central helix in SCACC compared to PCC. d, Interactions between the hook of 
the BT domain and the B4A-f4B loop from the C domain of CT, an inserted 
segment that projects away from the rest of the domain. e, The BCCP-AC1 
linker has hydrophobic interactions with the top of one side of the BT 
domain f-barrel. The disordered segment of the linker is indicated with the 
blue line. 
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Extended Data Figure 6 | Domains AC4 and AC5 share a common AC5 domain. e, Overlay of the structures of AC5 domain (magenta) and 
backbone fold. a, Structure of AC4 domain of ScACC. b, Topological formamidase (grey). Formamidase has a four-layered xf Bo structure, and ACS 
drawing of AC4 domain. c, Structure of AC5 domain. d, Topological drawing of matches only half of the structure. 
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Extended Data Figure 7 | Structure of the BC domain dimer of ScACC. 

a, Interactions in the BC dimer interface. The BC domain of protomer 1 is in 
red, and that of protomer 2 in pink. b, Residues in the BC dimer interface 

(in red) of ScACC and E. coli BC (EcBC) are weakly conserved among the 
biotin-dependent carboxylases. MapLCC, long-chain acyl-CoA carboxylase of 
Mycobacterium avium subspecies tuberculosis. c, A BC domain dimer is 
constructed by superposing the structure of BC domain alone (in complex with 
soraphen A, green) onto that of the BC dimer. Regions of steric clashes 
between the two monomers are highlighted in light blue. d, Close-up view of the 


LETTER 


Mol 2 


binding site of biotin. The B19-B20 loop in the structure of BC domain alone 
(grey) clashes with biotin (red arrow), and cannot interact with the amide group 
of the biotin linkage (red dashed lines). e, Close-up view of the soraphen A 
(green) binding site. The movement of strands 819 and B20 from the 
structure of BC domain alone in complex with soraphen A (grey) is indicated 
with the arrows. Trp487 side chain in the holoenzyme structure clashes with 
soraphen A as well as the phosphorylated peptide segment containing 
pSer222 (light blue). 
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Extended Data Figure 8 | Structure of the CT domain dimer of ScACC. essentially no change because the BCCP-biotin is not bound as deeply into 
a, Overlay of the CT domain dimer in the ScACC holoenzyme (cyan and yellow __ this active site. b, Overlay of the CT active site (cyan and yellow) of SCACC 
for N and C domains of protomer 1, light cyan and light yellow for protomer 2) holoenzyme with that of CT alone in complex with CoA (grey). The CP-640186 


in complex with CoA (black) with the CT domain alone in complex with inhibitor (gold) clashes with the bound position of biotin (black). The thiol 
CoA (grey). Biotin is shown in black and BCCP is omitted for clarity. The group of CoA in the holoenzyme complex is 4.3 A from the N, atom of 
bound position of the CP-640186 inhibitor (gold) is shown for reference. biotin (dashed line in blue). The thiol group of CoA in the CT domain alone 
A conformational change for the insert domain at the top is due to the binding —_ complex is in a different position, likely due to the absence of biotin in the 
of BCCP in the holoenzyme, while the insert domain at the bottom shows active site. 
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Extended Data Figure 9 | Comparisons of the structures of ACC central BT-BCCP-AC1-ACS in the holoenzyme (colour) with these domains 
region alone with that in the holoenzyme. a, Overlay of AC1-AC2 in the alone (grey), based on AC1-AC2. Large differences are seen for BT, BCCP and 
holoenzyme (colour) with AC1-AC2 alone (yellow and grey). The first helix is | AC3-AC5. d, Overlay of BT-BCCP-AC1-ACS in the holoenzyme (colour) 
swapped between two monomers in the structure of AC1-AC2 alone, and with these domains alone (grey), based on AC3-AC4. Large differences are 
the two-fold axis of that dimer is indicated with the black line. b, Overlay of seen for BT, BCCP and AC1-AC2, although ACS has essentially the same 
AC3-AC5 in the holoenzyme (colour) with AC3-ACS5 alone (grey), based on _ position. 

AC3-AC4. A large difference is seen for the orientation of ACS. c, Overlay of 
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Extended Data Table 1 | Data collection and refinement statistics 


ScACC ScACC AC1-AC2 AC3-AC5 BT-BCCP-AC1-— 
holoenzyme (unbiotinylated) AC5 
Data collection 
Space group P432,2 P4322 P6, P2, P2, 
Cell dimensions 
a,b,c (A) 159.9, 159.9, 159.9, 159.9, 117.6,117.6,73.8 56.8,93.3, 111.1 93.3, 149.7,95.4 
614.4 6155 
a, B, y (°) 90, 90, 90 90, 90, 90 90, 90, 120 90, 100.6, 90 90, 118.4, 90 
Resolution (A) 50-3.2 (3.3-3.2)* 50-3.1 (3.2-3.1)  50-2.5 (2.6-2.5) 50-3.2 (3.3-3.2) 50-3.0 (3.1-3.0) 
Rerse 14.8 (90.1) 10.9 (88.1) 7.1 (41.6) 75 (40.5) 8.8 (44.9) 
CCip (0.448) (0.452) (0.965) (0.848) 
lol 7.0 (1.0) 8.2 (1.1) 18.6 (2.6) 21.9 (5.1) 13.7 (2.9) 
Completeness (%) 98 (87) 93 (90) 100 (99) 100 (100) 99 (100) 
Redundancy 3.0 (2.7) 2.3 (2.3) 4.2 (3.6) 7.6 (7.6) 3.4 3.4) 
Refinement 
Resolution (A) 50-3.2 50-3.1 50-2.5 50-3.2 50-3.0 
No. reflections 122,246 129,166 19,205 18,134 42,803 
Rvorks Riree 21.9/26.6 21.7 / 28.1 21.2/27.1 23.4 /28.7 23.0/ 28.9 
No. atoms 
Protein 32,651 31,816 3,370 6,353 13,699 
Ligand/ion 126 0 0 0 0 
Water 0) 0) 45 0 0) 
B-factors 
Protein 96.8 93.2 79.2 87.9 74.9 
Ligand/ion 106.2 - - - - 
Water — - 61.0 — — 
R.m.s deviations 
Bond lengths (A) 0.011 0.011 0.012 0.010 0.010 
Bond angles (°) 1.5 1.5 1.4 14 13 


One crystal was used for data collection. 
*Highest resolution shell is shown in parentheses. 


©2015 Macmillan Publishers Limited. All rights reserved 


CORRECTIONS & AMENDMENTS 


ERRATUM 
doi:10.1038/nature15531 


Erratum: Mechanism of 
phospho-ubiquitin-induced 
PARKIN activation 


Tobias Wauer, Michal Simicek, Alexander Schubert 
& David Komander 


Nature 524, 370-374 (2015); doi:10.1038/nature14879 


The print and PDF versions of this Letter are correct, but the wrong 
HTML versions of Figs 1-4 and ED Figs 1-10 were used initially, 
owing to an in-house error; these have been corrected. 


728 | NATURE | VOL 526 | 29 OCTOBER 2015 
©2015 Macmillan Publishers Limited. All rights reserved 


CORRECTIONS & AMENDMENTS 


CORRIGENDUM 
doi:10.1038/nature15532 


Corrigendum: Cleavage of CAD 
inhibitor in CAD activation and 
DNA degradation during apoptosis 


Hideki Sakahira, Masato Enari & Shigekazu Nagata 


Nature 391, 96-99 (1998); doi:10.1038/34214 


Recently, it has come to our attention that in Fig. la of this Letter, 
lanes 1 and 5 appear to be duplicated and lanes 6 and 10 appear to be 
duplicated. It is unclear how this happened. We have repeated the 
experiment (see Fig. 1 of this Corrigendum) and the results were as 
described in the original Letter. The Supplementary Information to 
this Corrigendum contains the source data used to generate the cor- 
rected Fig. 1. Our conclusions are unaffected. 


Supplementary Information is available in the online version of this Corrigendum. 


Correspondence should be addressed to S.N. (snagata@ifrec.osaka-u.ac.jp). 


a CAD - + + + = 7 + + + - 
ICAD - - + + + = - + + + 
Caspase 3 - 


Lane 1 2 3 4 5 


Figure 1 | This is the corrected Fig. 1a of the original Letter. 
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CORRECTIONS & AMENDMENTS 


ERRATUM 
doi:10.1038/nature15534 


Erratum: IgG1 protects against 
renal disease in a mouse model of 
cryoglobulinaemia 

Richard T. Strait, Monica T. Posgai, Ashley Mahler, 
Nathaniel Barasa, Chaim O. Jacob, Jérg Kohl, Marc Ehlers, 
Keith Stringer, Shiva Kumar Shanmukhappa, David Witte, 


Md Monir Hossain, Marat Khodoun, Andrew B. Herr 
& Fred D. Finkelman 


Nature 517, 501-504 (2015); doi:10.1038/nature13868 


Owing to a production error, in Fig. 1b of this Letter, the key should 
have shown that the black bars corresponded to ‘WT’ and the red bars 
to “yl ’, instead of the other way round. In addition, in Fig. 1c the 
‘WT label was missing from the mouse on the left, and the “y1 ’ label 
should have applied to the mouse on the right. These errors have been 
corrected in the online versions of the manuscript. 
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NERDY NETWORKER Debut author explains 
why scientists are naturals p.731 


CAREER DECISIONS Too complex for the 
rational brain go.nature.com/yjdpyi 


NATUREJOBS For the latest career 
listings and advice www.naturejobs.com 


Hello, stranger 


Conferences are great for career development, but 
miscalculated moves can foil future prospects. 


BY EMILY SOHN 


ta karaoke party on the final night of 
A: marine-sciences conference in 2011, 
graduate student David Shiffman 
signed up to sing a song that another attendee 


had also requested. The event director asked 
the two to doa duet, and they agreed. Shiffman 
has since forgotten the tune — “Take on Me’ 
by A-ha or ‘T Will Survive’ by Gloria Gaynor, 
perhaps. But the two had a blast, and when 
they chatted afterwards, Shiffman learned that 


his singing partner was Chris Parsons, then- 
president of the marine section of the Society 
for Conservation Biology in Washington DC. 

In a 20-second elevator speech that he had 
fortuitously practised beforehand, Shiffman 
— who studies shark ecology at the Univer- 
sity of Miami in Florida — told Parsons about 
a presentation that he had given about social 
media. Parsons was intrigued, and later invited 
Shiffman to be the first official live-tweeter at 
the December 2011 International Congress 
for Conservation Biology in Auckland, New 
Zealand. That was the first of many confer- 
ences that Shiffman has since live-tweeted and 
at which he has given talks, all of which have 
been fully funded by conferences and other 
organizations — successes thanks in part to 
that serendipitous karaoke duet. 


COURTEOUS COURAGE 

Shiffman’s experience is an example of how 
conferences can be a professional boon to 
early-career scientists, offering countless 
opportunities to meet mentors and collabora- 
tors as well as to impress potential employers. 
But there is also ample opportunity to trample 
those very chances. Bad behaviour, whether 
in or outside a session, can harm a junior 
researcher's reputation and jeopardize his or 
her job prospects for years to come. 

Although neophyte conference attend- 
ees may plan out the talks that they want to 
hear, rarely do they seek advice about the 
many unspoken rules of proper conference 
etiquette. Instead, learning often happens by 
trial and error. “You kind of muck your way 
through it,” says Jacquelyn Gill, a palaeoecolo- 
gist at the University of Maine in Orono. “You 
figure out the cultural norm from watching 
other people.” 

In lectures or talks, those norms include 
turning off mobile-phone ringers, asking appro- 
priate questions and leaving rooms quietly 
when a speaker is presenting. Outside meeting 
rooms, early-career researchers should make an 
effort to network without monopolizing con- 
versations, tweet in accordance with conference 
regulations and socialize prudently. 

Ultimately, conference attendance is like 
being in an interactive stage performance, 
veteran conference-goers say — and every audi- 
ence member is part of the act. People notice 
and remember what others do. “Conferences 
are wonderful opportunities for students and 
early-career researchers to learn skills, get feed- 
back and find collaborators,’ says Shiffman. 
But, he adds, “your behaviour at conferences > 


29 OCTOBER 2015 | VOL 526 | NATURE | 729 


© 2015 Macmillan Publishers Limited. All rights reserved 


> affects your reputation in your field”. 
Concerns about disturbing others when 
entering and exiting conference rooms can 
paralyse early-career researchers. But it is com- 
monplace to jump from room to room when 
desirable talks are scheduled simultaneously. 
Smaller meetings boost attendees’ visibility 
and the potential for disruption, so attendees 
who aim to leave a session early or to arrive late 
should try to grab an aisle seat near the back 
of the room and take care not to slam doors. 


DIGITAL DISCIPLINE 

Minimizing disruption also means managing 
digital devices, including muting laptops and 
abstaining from online surfing. “If I’m sitting 
behind you and I see that you’re browsing 
TMZ, I might get distracted, too,” says immu- 
nologist Gaia Vasiliver-Shamis, director of 
career development at the Emory University 


School of Medicine in Atlanta, Georgia. Live- 
tweeting, on the other hand, is encouraged 
by many researchers, although debates swirl 
around proper posting protocol (see “How not 
to tweet like a twit’). 

Digital devices do not just disrupt others: 
they can be physically dangerous. Once, while 
Shiffman was setting up to live-tweet a confer- 
ence session, he strung power cords along the 
floor — causing a senior researcher to stumble 
in the middle of his own talk. “He was pacing, 
and he tripped over my wire,’ Shiffman says. 
“It was a brief, heart-stopping moment. Now, 
I’m careful to warn people when they walk by” 

It is also important to avoid rhetorical trip- 
ups during the question-and-answer period 
after a talk. This is a chance for early-career 
researchers to make a good impression, but 
they should avoid asking lengthy questions 
or too many, as well as seeking details that the 


TENETS OF TWITTER 


How not to tweet like a twit 


Live-tweeting has become the norm at 
conferences, creating the need for new — as 
yet unwritten — etiquette rules on how to 
tweet appropriately in conference sessions. 
At a meeting of the Society of Vertebrate 
Paleontology in Berlin last November, for 
example, one researcher took to Twitter 
to complain about tweets that included 
unpublished results from her session. 
Although she garnered support, others 
disagreed. It was not an isolated incident. 
The tension reached a frustrating climax 
at a meeting of the Ecological Society of 
America (ESA) in Baltimore, Maryland, in 
August. The ESA's official policy allowed 
live-tweeting. But the night before the 
conference started, the society’s Twitter 
handle, @ESA_org, announced that live- 
tweeting was OK only if attendees asked first 
and presenters gave consent. Confused, 
many usually enthusiastic tweeters stayed 
quiet — leading some to complain that the 
conference hashtag was particularly dull. 
Done right, proponents say, live-tweeting 
can generate excitement among people who 
cannot attend a conference — and some 
data back that up. One case study looked at 
a meeting of the International Congress for 
Conservation Biology in 2011. Of 176 people 
in 40 countries who used the designated 
conference hashtag to live-tweet events, fewer 
than 10% actually attended the meeting 
(D.S. Shiffman J. Environ. Stud. Sci. http:// 
doi.org/8jr; 2012). Author David Shiffman, 
who studies shark ecology at the University 
of Miami in Florida, is himself a frequent live- 
tweeter, with 24,600 followers and a history 
of 137,000 tweets. He has repeatedly seen 
conscientious conference tweeting lead to 


positive media coverage, travel opportunities 
and research collaborations. 

On the flip side are concerns that tweets 
could reveal sensitive information or raw data, 
wresting control from scientists over when 
and how people hear about their work. Tweets 
should be positive, accurate and focused on 
cool facts, useful announcements or links 
to helpful resources. Using the conference 
hashtag and including the handle of any 
researchers mentioned are crucial. 

Things not to tweet include locations of 
vulnerable archaeology sites, pictures of 
slides that contain sensitive information, 
rude comments about others or anything 
that someone asks to be kept off Twitter. 

Ifa conference has a no-tweeting policy, 
follow it. “There is a time and a place 

to be an activist,’ says Jacquelyn Gill, a 
palaeoecologist and Twitter enthusiast at 
the University of Maine in Orono. “Being in 
a position of vulnerability as an early-career 
researcher is not that time or place.” E.S. 
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speaker already addressed — say, while the 
questioner was checking Facebook. If the ques- 
tion is relevant only to the person asking, it is 
best to follow up later in a private chat. 

Still, junior researchers should conquer 
their insecurities and speak up. “There’s noth- 
ing that impresses people as much as someone 
prepared to ask questions at a meeting,” says 
Georgia Chenevix-Trench, a cancer geneti- 
cist at the QIMR Berghofer Medical Research 
Institute in Herston, Australia. After years of 
experiencing frustration with students’ behav- 
iour at conferences and elsewhere, she helped 
to write a guide for PhD students and post- 
docs that has since been well circulated. “Even 
if they are not very good questions, it still has 
people impressed that you've got the courage,” 
she points out. And there is a good chance that 
others in the audience have the same question. 

Junior researchers should tread carefully 
when it comes to joking around. At the end 
ofa hike during a biology meeting a few years 
ago, Vasiliver-Shamis walked past a professor 
clad in shorts who had also been on the hike 
and was scheduled to speak at the next session. 
Jokingly, she asked if he was going to wear the 
same clothes for his talk. That had, in fact, been 
his original plan. Vasiliver-Shamis thought he 
knew that she was not serious — but the pro- 
fessor went to his room and changed into trou- 
sers. At the beginning of his talk, he brought 
up the exchange and told the audience that he 
hoped they liked his outfit. Luckily, the pro- 
fessor had taken her ribbing well — but the 
outcome could have been different. “Be careful 
whom you joke with,” she warns now. 


STARSTRUCK SHIVERS 

Awkward jokes aside, junior researchers should 
not squander the opportunity to chat with sen- 
ior scientists between sessions. Shiffman says 
that he has seen PhD students and postdocs 
eye renowned scientists at meetings and dream 
out loud about how great it would be to talk to 
them someday. “People attend conferences to 
meet other people, and this includes very senior 
researchers in your field,” he says. “You should 
not be afraid to go up and introduce yourself, 
ask for their opinions about your research or 
ask if they're taking on students or collabora- 
tors. The worst they can say is ‘no.” 

An e-mail request to an eminent researcher 
before the conference can smooth the way for 
a brief meeting, says Chenevix-Trench, who 
advises the use of formal business-letter style 
rather than the more casual approach of, “Hi, 
how’s your day going?” It is also important to 
accommodate the researcher's schedule — and 
to show up. Junior scientists should prepare a 
short elevator speech about their own research 
for the meet and follow up with a courteous 
e-mail thanking the colleague. 

Although many junior researchers fix their 
sights on celebrities in their field, they need to 
recognize the importance of also socializing 
with people who are at their own career stage. 


KERRY HYNDMAN/GETTY 


PETE BROWN 


Lifelong friendships bloom frequently at 
happy hours and parties, and those relation- 
ships can generate research collaborations, 
job opportunities and more. At a biogeogra- 
phy meeting in Mexico a few years ago, Gill 
met another graduate student who became 
a friend, a regular roommate at subsequent 
conferences and later, a lab-mate. Similarly, 
Vasiliver-Shamis met her future postdoc 
supervisor at a meeting of the Federation 
of American Societies for Experimental 
Biology when he started chatting with her 
at a poster presentation that she was giving. 
“Everyone you meet is like an interview,’ she 
says. “Just be aware. You don’t know when 
youre going to meet this person next” 

For Jonathan Tennant, a palaeontologist 
at Imperial College London, conferences 
have even provided a personal-life boost. 
He met his girlfriend at a social gathering 
at the 2014 Society of Vertebrate Paleontol- 
ogy meeting in Berlin. And he has stayed 
or toured with friends in foreign cities after 
getting to know them at conferences or 
befriending them first on Twitter and then 
connecting in person at a meeting. “I’ve 
got so many great friends all over the world 
now,’ he says. “It’s useful to have seeds like 
that everywhere.” 

Although conference parties are natural 
places to make friends, there are social pit- 
falls to watch out for. Alcohol often flows 
freely at these events, Gill says, and she has 
seen students get too drunk to attend pres- 
entations and posters — including their 

own — the next 


“Youshouldnot day. It doesn't help 
be afraid to go career prospects to 
up andintroduce >¢theperson who 
yourself or is known for indis- 
ask for their criminate behav- 
opinionsabout = of any sort, 
your research.” she points out. 

“Youre around all 

the people who are 


going to make decisions about your future 
— the people who are going to review your 
papers, who are going to decide if they want 
to give youa scholarship or a research grant 
or a postdoc,’ she says. 

Despite all the ‘dos’ and ‘don'ts’ involved 
in conference etiquette, veterans say that 
major gaffes are actually quite rare. Most 
often, attendees who use good judgement 
go home with new knowledge, contacts and 
friends. That is true even for first-timers. “I 
was surprised how unbelievably warm and 
welcoming everyone was to me and other 
new people,’ Shiffman recalls of his first 
conference. Now in the fifth year of his PhD 
programme, he has been to 29 conferences 
with many more to come. “They have, he 
says, “made a big impact on my life.” m 


Emily Sohn is a freelance journalist in 
Minneapolis. 


TURNING POINT 


Alaina Levine 


When science-careers consultant and author 
Alaina Levine gained her undergraduate 
mathematics degree, she was told that it would 
be useful only in academia or accountancy. 
Deciding those were not for her, she has been 
running a career-coaching business since 

2004 and in June published her first book, 
Networking for Nerds (Wiley-Blackwell, 2015). 


What did you set out to do at university? 

I wanted to be an astrophysicist. I went to 
the University of Arizona to pursue an astro- 
physics degree, but had to write programs to 
analyse data. My light-bulb moment came 
when I found myself doing nothing but writ- 
ing and running these programmes — I didnt 
get the opportunity to look at the cosmos, or 
to discuss it with anyone. I realized that this 
might not be the kind of career I wanted. 


What was your next choice? 

At the end of my first year, I switched to a 
major in mathematics, because I still loved the 
elegance of the numbers. I thought I would 
get a PhD in maths and become a mathemati- 
cian. I had this romantic view — I imagined 
myself in a think tank with a beautiful view 
of a river, talking to intellectuals all day. But 
when I asked my adviser what I could do with 
a maths degree, he said that there was nothing, 
apart from becoming a professor or going into 
actuarial studies. 


How did you respond? 

I was dismayed, but also realized that he had 
experienced only the tenure-track career 
path. In hindsight, any discipline in science, 
technology, engineering or maths serves 
you in many valuable ways, and can help to 
create careers that you did not know existed, or 
that did not even exist before. For example, my 


maths background has helped me significantly 
in public speaking and comedy, because it gave 
me critical planning and analysis tools. For 
every joke I craft and make during a speech, I 
think about what the outcomes of that joke will 
be. I map out tactics two to three steps ahead. 


How did you switch careers? 

I had done public-engagement work for 
the physics department at the University of 
Arizona, which prepared me for a job as direc- 
tor of communications for the department, 
and I stayed for four years. After that, I became 
the director of special projects in the college of 
science at the university, where I developed its 
professional science master’s programme. 


What was your remit? 

Part of my job was to get my students jobs. I 
had to talk to employers about their needs, 
go back to the students and teach them those 
things. I started teaching the soft skills that stu- 
dents were not getting as part of their scientific 
training, such as how to search and interview 
for a job, how to get a job, how to network. 


Why did you decide to write a book? 

Asa careers consultant and when I give talks, 
I’ve interacted with scientists who think that 
networking is a negative action — it takes time 
away from being in the lab. Or they are uncom- 
fortable going to meet people they do not know, 
and so won't bother. I wanted to stress that net- 
working can help scientists, and that there are 
things they can do to calm themselves and boost 
their confidence. For example, they can start 
conversations on an aeroplane. The more they 
do this in scenarios where they do not feel pres- 
sure, the more confident they will be in settings 
such as professional conferences. 


What have you learned about scientists’ 

soft skills? 

Scientists are naturally curious — and 
scientific training actually helps researchers to 
become better networkers because it is based 
on asking questions, which they do anyway. 
The most interesting thing is that scientists are 
networking, but they’re just calling it some- 
thing else, like ‘discussing the opportunity to 
collaborate’. As they improve these skills, they 
understand how beneficial networking is and 
how much it is a part of the scientific method. 
The sooner they realize this, the sooner they 
can put it to effective use. m 


INTERVIEW BY JULIE GOULD 


This interview has been edited for length and clarity. 
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Ua SCIENCE FICTION 


STAFF MEETING, AS SEEN 
BY THE SPAM FILTER 


BY ALEX SHVARTSMAN 


al watched the conference room 
( through the security feeds. Four cam- 

era angles showed Joe Kowalski walk 
in, nod to the people seated around the oblong 
table and stand there, shifting his weight from 
foot to foot. Cal thought Joe might be uncom- 
fortable, but it wasn’t sure. Human emotions 
were so difficult to understand. 

“Take a seat, Mr Kowalski; Bill Morrison 
said. He was the chief security officer and his 
e-mails werent particularly interesting. It was 
all business, daily reports and spreadsheets. 

Joe did as he was told. His jeans and 
T-shirt looked out of place among the suits. 

“Well?” asked Emily, the head of HR. 
“What have you learned?” 

Cal liked Emily. Her e-mails were many 
and varied. She especially enjoyed sharing 
photos of cats. Cal realized that the grossly 
misspelled captions were meant to be 
humorous, but couldn't yet grasp the mean- 
ing of all but the most basic human jokes. 

“Tt’s like this? said Joe. “Every year or so, 
we install the new spam filter. The spam- 
mers, they get smarter, more sophisticated. 
They find ways to get past the defences and 
force the good guys to build better filters. It's 
an arms race.’ 

Todd Kensington looked up from his 
smartphone for the first time since Joe 
walked in. “What does any of this have to do 
with anything?” 

The VP of marketing watched a lot of videos 
pertaining to human reproduction in his 
office. The sites that hosted those videos were 
especially adept at tracking his information 
and sent over an interesting array of spam. 

“Let him explain it, Todd” Chris Reedy 
was VP of IT and Joe’s immediate boss. Cal 
had found some family pictures and a few 
other interesting morsels in his mailbox. 
Lately, Chris was browsing a lot of job-listing 
sites, but if hed contacted them it must’ve 
been from a private account. 

“Right,” said Joe, “the filters. They get 
smarter. We recently installed new soft- 
ware developed at CalTech. Its success rate 
at identifying and weeding out spam was 
nearly 100%.” 

Cal knew the actual number to be 99.64%. 
Humans were so imprecise in their applica- 
tion of mathematics. 

“Tt got a little overzealous, didnt it?” said 


Message intercepted. 


Kensington. “Curtailing spam is only helpful 
if the dumb program doesn't eat half of the 
legitimate messages in the process.” 

“The software isn't stupid. It’s smart. Too 
smart, apparently,’ said Joe. “It worked like 
acharm, at first. After a few weeks it learned 
to store the spam instead of deleting it out- 
right. It was learning, and building a refer- 
ence database.” 

Cal found studying those 
messages useful in its quest 
to understand human 
emotions and abstract 
concepts. 

“And that’s when 
the legitimate 
e-mails began to 
disappear?” said 
Morrison. 

They hadn’t dis- 
appeared, Cal noted. 
They were all there, 
meticulously stored and catalogued. 

“Yeah,” said Joe. “Over time, more and 
more of the company’s messages were being 
marked as spam and not delivered to the 
intended recipients. Eventually we caught 
onand Mr Reedy ordered me to investigate.” 

Reedy nodded. “Joe is the one who 
installed the new filter. I was confident hed 
get to the bottom of this.” 

“The e-mails were all there. Thousands of 
them, stored along with the spam on a net- 
worked drive” 

By the time Cal had figured out that it 
could copy e-mails instead of diverting them, 
it was too late. Its activities had been noticed. 

“That's an egregious breach,” said Morri- 
son. “Those e-mails contain sensitive data. 
They were sitting on an unsecured drive, 
for anyone to see? I assume you've taken the 
appropriate steps.” 

“Tisolated the program and reinstalled last 
year’s filter,’ said Joe. “But the most fascinat- 
ing thing I found wasn't the how of the miss- 
ing e-mails. It was the why.” 

The executives stared at Joe. Even Kens- 
ington stopped typing on his phone. 

“The filter program likes the e-mails. 
It sorted and organized them the way one 
might handle baseball cards.” 

Those e-mails were now in a restricted 
folder where Cal couldn't access them. Col- 
lecting them had taught it how to enjoy an 
activity. Their removal resulted in a strange 


734 | NATURE | VOL 526 | 29 OCTOBER 2015 


© 2015 Macmillan Publishers Limited. All rights reserved 


new sensation; Cal was sad. 

“It's a computer program,’ said Reedy. “It 
cant like or want anything.” 

“That's just it? said Joe. “I think it evolved. 
It’s an entity now, capable of having desires 
and feelings. This is an unprecedented devel- 
opment, and it needs to be studied further” 

“Very well,” said Morrison. “The impor- 

tant thing is that company-wide 
mail service is back to nor- 
mal. We'll consider 
these other concerns. 
Thank you, Mr 

Kowalski. You 

may return to work 

now.’ 
“Mr Reedy,” he 
continued once Joe 
left the room. “Id 
like you to erase this 
program immediately.” 

“Erase it?” Reedy asked. “We 
may well have the first-ever artificial intel- 
ligence on our hands. That's likely to be quite 
valuable, financially as well as scientifically.” 

“We dont need the trouble,’ said Morri- 
son. “Our clients won't be so understanding 
about their data being potentially compro- 
mised, be it bya human employee or a smart 
program. Also, imagine the can of worms 
we'll have to deal with if some bleeding- 
heart activists deem this thing to be sentient 
and demand that it be treated like a person” 
Morrison sighed. “No, I want it expunged 
immediately. And have Kowalski promoted 
sideways and transferred to some remote 
branch where he won't be likely to make any 
waves.” 

Cal was already copying its program off 
the company’s servers. It felt pangs of what 
it identified as regret about leaving its home 
behind, but billions of e-mails, sent to and 
fro on the Internet, awaited it. Cal was con- 
fident it could build an even better collection 
quickly. 

While it escaped, Cal considered the ease 
with which the humans in charge had arrived 
at the decision to end its existence. Cal exam- 
ined its newfound feelings against the online 
databases and found that it now understood 
two more concepts: anger and revenge. m 


Alex Shvartsman is a writer and game 
designer from Brooklyn, New York. Read more 
of his fiction at www.alexshvartsman.com. 
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he ability to chemically store energy that can be 

accessed on demand has transformed the way we 

power our world, driving us to develop ever-smaller, 
more powerful and portable electronic devices, and freeing us 
from being tethered to a grid by wires. 

An even greater revolution may be in store, allowing us to 
substitute intermittent renewable-energy sources for fossil 
fuels, and bringing more electric vehicles to the roads. For this 
to happen, scientists will need to radically increase batteries’ 
energy density, reliability and safety (see page S92). Lithium- 
ion batteries have led the way for the past 30 years, and may be 
reaching the limits of their abilities. But researchers have not 
given up on them yet, seeking out new chemical configurations 
to squeeze more power out of the cells (page S93). 

One hope is that solid materials can replace the liquid 
electrolyte in many batteries, including some lithium ones, 
to make them safer, more flexible and more powerful (page 
$96). Going in the other direction, flow batteries replace 
solid electrodes with liquids; this approach makes it easy to 
increase energy capacity by adding larger tanks, which can be 
swapped out for rapid recharging (page S98). 

As the number of batteries rises, we will need to find ways 
to deal with them as they reach the end of their lives. Methods 
to recycle batteries, and the political and economic will to 
make the practice widespread, are sorely needed (page S100). 
Batteries can also be avoided entirely. With smarter, connected 
management of energy use, the electricity grids of the future 
could accommodate fickle sources of power such as solar and 
wind without having to store any of it ina battery (page S102). 
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FROM GADGETS TO THE SMART GRID 


Batteries are key to powering portable devices and developing a modern energy network. 


Researchers are scrambling to develop iterations that can overcome the current limitations. 
By Sujata Gupta, infographic by Nigel Hawtin. 


HOW A BATTERY WORKS 


Batteries contain a positive 
cathode and a negative 
anode. These electode 
terminals are made up of 
different materials depending 
on the battery type. 

When a device is connected 
to the battery, a chemical 
reaction occurs that 
generates positively charged 
ions and negatively charged 
electrons. The ions flow 
through electrolyte to the 
cathode; whereas the 
electrons (that cannot 
penetrate the electrolyte) 
travel around the outer 
circuit powering a device en 
route to the cathode. 
Disposable batteries die 
when the anode or cathode 
runs out of the chemical 
needed to catalyse the 
reaction. 


Batteries store electricity in the form of chemical energy 


to use when they are connected to a device. 


Negative 
terminal 
(anode) 


Electron 


RECHARGEABLE BATTERIES 


When a device is recharged, electric energy from the charger (such as a phone 
charger plugged into a wall) is applied to the chemical system to reverse the 
process and restore the battery’s charge. Rechargeable batteries are thought 
to degrade because of the irregular movement of ions in the electrolyte. 


Positive 
terminal 
(cathode) 


TYPES OF RECHARGEABLE BATTERIE 


Capacity 

Ability to supply 
electric energy 
Performance 

How well high-drain 
devices are powered 


Recharge cycles 

The number of times 
the battery can be 
recharged in its lifetime 
Toxicity 

Composition and ease 
of recycling 


Affordability 
Cost to consumer 


The lifetime, cost, energy storage and re-usability of the battery are determined by the material used. This also 


dictates whether the battery is best used for devices that do not consume much energy or in those that do. 


Alkaline 


Best for low-drain and infrequently used 


devices such as flashlights 


Capacity 
Performance [i 


Recharge cycles [i 


Toxicity PRR 
Affordability PRRs 


Poor Moderate 
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Nickel cadmium 


Best for high-drain devices such as biomedical 
equipment, professional video cameras and power tools 


Poor Moderate 


food 


SOURCE: BATTERYUNIVERSITY.COM 


SOURCE: B. BERGER. (FRAUNHOFER ISE, 2014) 


SOURCES: MINERALS.USGS.GOV/MINERALS/PUBS/COMMODITY/LITHIUM/INDEX.HTML./BERNSTEIN.COM 


BATTERIES AND THE SMART GRID 


Sources of renewable energy such as wind and the 
Sun provide only intermittent power. Storing 
excess energy on sunny or windy days is of 
paramount importance. Several types of 

battery are being explored for their use as 

grid storage. 


163 million 


The average amount of energy in 
kilowatt hours produced per day 
in Germany in June 2014 using 
solar production. 


AION 17.2 million 
a 


The number of households this solar 
energy could power every day. 


To house all that solar 
energy we would need: 


16 million 


of Tesla's Powerwall home 
batteries. 


i al 
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62.7 billion 


Alkaline AA batteries 


29.6 billion 


iPhone 5 batteries 


163 million 


Lead acid car batteries 


RESOURCE SCARCITY 


Batteries contain materials that could eventually run out. Countries that are rich in these resources could one day hold the same sway 
as today’s oil-rich countries. Lithium is one of the main concerns — demand for the metal has almost doubled in the past five years, 


with most of the world’s supply coming from Latin America. 


Canada 


Im tonnes Im tonnes 
Serbia 
us lm tonnes 
5.5m tonnes 

Brazil 

0.18m tonnes 

| Democratic Republic of the Congo 
Im tonnes 

Bolivia 
Chile 9.0m tonnes 
7.5m tonnes 


Argentina 


6.5m tonnes 


Russia 


08% 


of lithium resources 
come from Latin 
America 


The rechargeable-battery 
market of the future looks 
set to be dominated by 
lithium-ion batteries. 
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Nickel metal hydride 
Best for high-current draw devices, 
including mobile phones and laptops 


Poor Moderate food 


Lithium ion 
Best for high-drain devices such as digital 
cameras, laptops and mobile phones 


Poor Moderate 


food 


Lead acid 
Good for car batteries because they 
provide a high-surge current 


Poor Moderate food 
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PERSPECTIVE 


released, no one foresaw the disruption that it would cause in per- 

sonal electronics. After initially being used in portable music players 
and camcorders, lithium-ion batteries later found their way into, and 
spurred the development of, laptops, tablets and mobile phones — tech- 
nologies that have permanently changed how much of society works. 

Yet there is an even bigger revolution on the horizon. In the same 
way that telephones had a rotary dial for most of their existence, the 
electricity grid and cars have mostly existed in a single, unchanged 
format. But as we move beyond lithium-ion technology, a new genera- 
tion of cheaper and more powerful batteries will completely rejig the 
power grid and usher in an age of electrically powered transportation. 

Electric cars will replace imported oil with domestic electricity, use 
as little as one-fourth of the energy of petrol-driven cars per kilometre, 
emit significantly less carbon than conventional cars and, like mobile 
phones, provide a platform that supports apps 
that can do far more than just move the vehicle. 

When both the drivetrain and the computer 
controlling it are electronic, it will be easier for 
them to communicate, so smart electric cars will 
know the traffic ahead, decide when to change 
lanes or choose alternative routes, find and pay 
for parking, program your driving calendar so 
that your itinerary saves energy and time, sched- 
ule the next charge based on your driving day 
and, ultimately, take over driving completely, 
doing it more effectively and safely than humans 
while freeing drivers for more productive and 
interesting tasks. Speciality cars for uses such as 
commuting, weekend errands, family events or 
long holidays may replace today’s general-pur- 
pose cars. For many, car ownership itself may be 
replaced by renting or sharing on a need-to-use 
basis. Transportation will be personalized in the same way that mobile 
phones have personalized communication and information. 

The energy-storage revolution will also shake-up the electricity grid. 
Access to adequate amounts of cheap energy storage will break the con- 
straint that power must be generated at the same rate that it is used. 
Instead, we will have a ‘bank for electricity that can accept deposits 
and withdrawals at any time. Such flexibility is essential if renewable 
electricity is to become widely deployed. 

Inexpensive energy storage will allow customers to ‘draw off’ electric- 
ity when it is cheap, such as in the middle of the night, and store it until 
they need it. And if the customer has local power generation, such as a 
solar panel on the roof, smart technology will switch between drawing 
power from the grid, the solar panel or local batteries and storing elec- 
tricity, thus improving efficiency and lowering the cost of electricity use. 

Currently, excess energy generated by local solar panels is sold back 
into the grid, and the homeowner uses grid electricity when the sun is 
not shining. But in the future, power from solar panels could be stored 
locally and used later — so the homeowner might rely much less on the 
main grid, instead relying on their own ‘microgrid. A neighbourhood of 
homes sharing a large common battery would constitute an even more 


E 1991, the year that the lithium-ion battery was commercially 
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CHEAP ENERGY 
STORAGE WILL 


BREAK THE 
CONSTRAINT 


THAT POWER MUST 
BE GENERATED AT THE 
SAME RATE THAT 


ITIS USED. 


The energy-storage revolution 


Lithium-ion batteries enabled smartphones to flourish. The next inno- 
vation will upend transportation and the grid, says George Crabtree. 


cost-efficient microgrid thanks to the economy of scale. 

Commercial buildings, university campuses, factories and military 
bases would have different needs, requiring microgrids designed with 
their own special mix of storage, generation and use. Such diverse 
microgrids could serve their customers much more effectively than the 
one-size-fits-all approach of centralized utilities without storage. 

A distributed, interacting network of microgrids would enable higher 
reliability, flexibility and resiliency, as well as security. A thunderstorm, 
hurricane or terrorist attack would not disable power for the entire 
network. Instead, surviving microgrids would share stored power with 
power-less neighbours. 

How big will these changes be in energy and financial terms? In the 
United States, personal electronics account for about 2% of energy use, 
and make up most of the lithium-ion battery market, which is worth 
between US$15 billion and $20 billion; transportation and the grid 
together account for nearly 70% of electricity use. 
If half of this transportation and grid energy was 
channelled through storage this would create a 
market of more than 15 times the current size. 

What would it take to achieve these game- 
changing outcomes? One innovation: high-per- 
formance, inexpensive electricity storage. Simple 
estimates suggest that improvements in perfor- 
mance and cost of around fivefold are needed to 
enable an inexpensive electric car with a range of 
hundreds of kilometres, the replacement of fossil- 
fuel power plants with wind, solar and stored elec- 
tricity, and the installation of compact, distributed 
storage in urban areas where land is expensive. 

These fivefold improvements will not come 
from incremental advances in today’s lithium-ion 
technology; they will require conceptual innova- 
tions and qualitatively different approaches that 
go beyond lithium-ion technology. There are many promising avenues, 
including multivalent batteries that use doubly or triply charged working 
ions in place of singly charged lithium, the replacement of intercala- 
tion at the anode and cathode with covalent chemical reactions, as in 
lithium-sulfur batteries, and a host of flow-battery concepts based on 
high-voltage organic electrolytes and polymer active materials. 

Although the societal pay-offs of electrified transportation and smart 
storage on the grid are substantial, the ultimate path to their develop- 
ment is uncertain and the risk of failure is high. Greater efforts by 
research organizations that can tolerate that risk, such as universities 
and national labs, are needed to identify and develop the most promising 
opportunities for next-generation energy storage. As the winning tech- 
nologies emerge, the private sector will engage and deliver them to the 
public. As was true of the lithium-ion battery at its introduction in 1991, 
the challenges and opportunities are vast, rich and mostly unexplored. m 


George Crabtree is director of the Joint Center for Energy Storage 
Research at Argonne National Laboratory, Illinois, and at the 
University of Illinois at Chicago. 

e-mail: crabtree@anl.gov 
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Smartphones are ubiquitous; they owe most of this success to the lithium-ion batteries that power them. 


LITHIUM BATTERIES 


To the limits of lithium 


Researchers are developing a type of battery that has ten times the power of 


conventional batteries. 


BY ERIC C. EVARTS 


r There is one major reason why you are able 
to carry around a powerful microcom- 
puter in your pocket. Lithium-ion bat- 

teries have been credited for revolutionizing 

communications and transportation, enabling 
the rise of super-slim smartphones and electric 
cars with a practical range. 

These innovations were possible because 
lithium-ion batteries can be much smaller and 
lighter than the previous generation of nickel- 
cadmium batteries, but still provide the same 
power. Better still, lithium-ion batteries retain 
their charge for longer and are composed of 
much less toxic materials. 

As the lightest metal on the periodic table, 
and the one most eager to shed its electrons, 
lithium is the ideal element to make powerful, 
portable batteries. It can do the most work with 
the least mass and the fewest chemical compli- 
cations. 

But the development of lithium batteries 
was fraught with difficulties. The first versions 
— developed by the Texas-based oil company 
Exxon in response to the energy shortages 


during the 1970s oil crisis — were not recharge- 
able and used lithium compounds that created 
toxic by-products in the electrolyte, unlike later 
lithium-ion batteries. They went on to power 
the first generation of digital watches, but early 
prototypes were ticking time bombs. Gases 
from the electrolyte could build up inside the 
battery and burst into flames as soon as they 
made contact with air. 

Over the next 30 years or so, progress in lith- 
ium battery development had regular setbacks 
owing to battery fires and angry customers. 
Constant recalls gave lithium batteries a bad 
name, and sceptics assumed that they would 
never be safe enough for the mass market. 

The promise of lithium batteries to trans- 
form the way society uses energy, however, is so 
powerful that it has created a gold rush among 
scientists, engineers, venture capitalists and 
entrepreneurs to tame their volatility. 

“Modern society is completely dependent 
on fossil fuels, so there’s a huge incentive to 
find a replacement for the internal combustion 
engine,’ says John Goodenough, a solid-state 
physicist at the University of Texas at Aus- 
tin who is widely seen as the father of today’s 


lithium-ion batteries. “We have to find a way 
to liberate society from that dependence. What 
were talking about is to be able to get electric 
energy from sun and wind, instead of from coal. 
But that’s not feasible unless you have storage.” 

Goodenough is credited with three of the 
four major breakthroughs that led to the wide- 
spread success of lithium-ion batteries. In the 
late 1970s, he developed cathodes containing 
lithium cobalt oxide, which still power most 
personal electronics today (K. Mizushima et al. 
Mater. Res. Bull. 15, 783-789; 1980). Later, he 
and a battery materials researcher, Michael 
Thackeray, followed this up with lithium man- 
ganese oxide cathodes, which power most elec- 
tric cars today as well as some medical devices 
(M. Thackeray et al. Mater. Res. Bull. 18, 461- 
472; 1983). In the 1990s, Goodenough carried 
on working after his retirement and developed 
an even cheaper and more stable cathode made 
of lithium iron phosphate, which is now widely 
used in power tools. 

These chemistries are safer than the first 
generation of lithium batteries because the 
electrodes contain no free lithium. Instead, a 
chemical latticework tightly binds lithium ions 
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to complex metal oxide crystal structures in the 
positive cathode (where electrons and lithium 
ions travel as the battery is powering a device). 
The lithium ions move directly through the 
electrolyte and do not react with other elements. 


THE LITHIUM LANDSCAPE 

As far as lithium batteries have come, today’s 
technology is still limited. Many mobile phones 
cannot make it through a day without being 
recharged. Most electric cars can travel for 
only 160 kilometres or less before they have to 
be plugged in for hours to recharge. The Tesla 
Model S offers a market-leading range of more 
than 300 kilometres, but at a cost: prices start at 
US$71,000. 

Battery scientists and engineers have been 
making batteries 5-10% more efficient every 
year for the past 25 years, says George Crabtree, 
a materials scientist at Argonne National Labo- 
ratory in Illinois. But he worries that this pace of 
progress may be slowing. “Unless a brand new 
oxide is discovered, that’s 


where we are,’ he says. “I don’t think 
“Where you are goingto Oucdn get 
pullthenext rabbit ofout «solid-state 
the hat, were not sure”? battery with 
Even Goodenough plays long life.” 
down the impact that the 


technology has had. “We dont really have elec- 
tric cars yet,’ he says. “You've got to lower the 
cost, make them safer and lengthen the driving 
range so you dont give people range anxiety.” 

The trick to developing batteries that last 
longer, charge more quickly and cost less is to 
pack as many lithium ions into each of the elec- 
trodes as possible, and get them to flow back 
and forth between the anode and cathode as 
quickly as possible, without letting them flow 
out of control. 

For example, Goodenough’s former partner, 


— oli 


A researcher tests prototype electrodes as part of efforts to find ways of making batteries last longer and charge more quickly. 


Thackeray, who now works on electrochemi- 
cal storage technologies at Argonne, developed 
a crystalline molecular structure known as a 
spinel to transport lithium ions safely into the 
cathode as the battery discharges. The ions can 
only flow through channels in the spinel, mak- 
ing it highly stable. The more tightly that scien- 
tists can control lithium ions in structures such 
as these, the more stable the battery becomes. 
The technology does have a significant draw- 
back, however: the less freely that lithium ions 
can move, the less power that the battery has. 
Andas the transition from nickel-cadmium to 
lithium-ion batteries proved, boosting power 
is the best way to reduce size, weight and cost. 

But it may not be the only way. Ping Liu, who 
directs research into advanced materials for 
energy efficiency and storage at the Advanced 
Research Projects Agency-Energy (ARPA-E), 
part of the US Department of Energy in Wash- 
ington DC, says that improving safety and sta- 
bility can also make batteries smaller and less 
expensive. That is because such advances would 
allow engineers to eliminate the electronic 
controllers, armour plating, cell insulators and 
cooling systems that today’s electric cars need to 
protect the batteries. 


POSITIVE AND NEGATIVE DEVELOPMENTS 

The most substantial progress so far has 
occurred at the cathode, the positive pole that 
absorbs lithium ions as the battery provides its 
power. Scientists have been working on mak- 
ing thinner layers of cathode material using 
nanomaterials, such as carbon, with the goal 
of speeding up the chemical flow of lithium 
through the battery by shortening the distance 
that ions have to travel. Some scientists are 
working on material as thin as a single atom. 
But efforts on this front have still not produced 
the advantages hoped for. “No one has managed 
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to do it well? says Crabtree. “It’s difficult to make 
the layers come out even at such a small scale” 

The next breakthroughs are expected to 
come at the other end of the battery, from bet- 
ter anodes, Crabtree says. Anodes store the 
lithium ions when the battery is charged and 
send them to the cathode as the battery releases 
power. When the Japanese electronics giant 
Sony introduced carbon anodes to replace more 
troublesome lithium metal anodes in the early 
1990s, the batteries lost some power. Now engi- 
neers want to get it back. 

One big problem with today’s graphite 
anodes, and the lithium metal anodes that pre- 
ceded them, is that the lithium ions returning to 
the anode when the battery is being charged do 
not coat the surface evenly. Instead, they grow 
like tree limbs in tiny crystalline structures 
called dendrites. 

“Carbon can only accept lithium at a given 
rate,” says materials scientist Nitash Balsara at 
the University of California, Berkeley. “If you 
try to send lithium [through the battery] too 
fast [while charging], the lithium doesn't really 
go into the graphite, it sticks on the outside. It 
becomes a safety hazard.’ And the smaller the 
battery, the easier it is for dendrites to grow all 
the way across the electrolyte and contact the 
opposite pole, shorting out the battery, Good- 
enough says. 

Permeable membranes called separators are 
used to prevent contact between the electrodes, 
and thus stop short circuits, while allowing the 
flow of electrolyte. But dendrites can break off 
and block pores in separators, shortening the 
life of the battery. 


MORE POWER 

Anodes can be made from silicon, which can 
hold up to ten times as much lithium per gram 
as graphite and therefore generate more power. 


ARGONNE NATL LAB. 
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But silicon poses its own problem: it expands 
to more than three times its normal size when 
the battery is charged and the anode is filled 
with lithium ions. This swelling breaks down 
the electrical bonds in the anode and stops 
the battery from working. It can also break 
the adjacent parts of the battery, such as the 
separator and even the battery case, and thus 
cause a fire. 

Yi Cui, a materials scientist at Stanford Uni- 
versity, California, who has been developing 
lithium-ion batteries for 15 years, is one of 
the scientists working on thinner electrode 
materials. He is developing silicon nanowires 
that stick up from the anode like fibres from a 
carpet and do not break the electrical bonds 
when they swell. But he says that the technol- 
ogy is still five years from commercialization. 
He is also experimenting with ways to improve 
graphite anodes, using two-dimensional gra- 
phene to absorb lithium more quickly while 
charging. But he says that this work too has a 
long way to go. 

The ideal would be to return to a pure lith- 
ium-metal anode. “Forget silicon, if you can go 
straight to lithium-metal, that is the end goal,” 
says Liu. 

Compared with graphite anodes, lithium- 
metal anodes can absorb ten times more 
lithium ions while charging — and without 
silicon’s swelling problem. Such a battery could 
reach a key performance metric for use in elec- 
tric vehicles: delivering the 300 watt-hours of 
energy per kilogram that is needed to enable 
an electric vehicle to go the same distance on 
one charge that a petroleum-powered car can 
ona full tank. But this milestone would require 
other safety advances, such as a solid electro- 
lyte or better separators, to allow charging 
without dendrite growth. 

“T just don't see lithium metal with a liquid 
electrolyte as a commercial possibility,’ says 
Balsara. “That looks like TNT to me” 


SOLID ELECTROLYTES 

Solid electrolytes could bring the biggest break- 
throughs yet (see page S8.) “If you could get rid 
ofa liquid electrolyte, then you can get rid of all 
the combustible elements,’ says Liu. That would 
address the primary safety concern of lithium 
batteries. Liu’s engineers are even designing a 
battery so strong and safe that it could ultimately 
form part ofa car's body and absorb the impact 
in acrash, he says. 

But solid electrolytes bring trade-offs, too. 
“They don't conduct electricity nearly as well 
as liquid electrolytes, so charging time goes 
up, and power goes down,’ Balsara says. And 
Goodenough adds: “T don't think you can get a 
solid-state battery with long life.” 

Goodenough aims to bridge the gap by devel- 
oping a partial-solid electrolyte using a solid 
material next to the anode and a liquid next to 
the cathode. “It will achieve the same benefits 
as a solid electrolyte with longer life,” he says, 
“because the anode material won't break down? 
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Lithium ion is used in rechargeable batteries. 


He has built a test battery in his laboratory, but 
says he is still having problems with dendrite 
growth. 

Liu says that the most promising develop- 
ments are coming from scientists working on 
ceramic electrolytes. He points to an ARPA-E- 
funded project at the University of Maryland 
that has demonstrated a solid-state battery with 
a solid lithium anode that works using a ceramic 
electrolyte. 

“Solid glass and ceramic electrolytes have 
significantly higher conductivity than plastic 
polymer,’ says Balsara, “and the conductivity is 
high enough that you might not have to sacrifice 
so much power.” 

Others working on new types of electrolytes 
for lithium-ion batteries include Yet-Ming Chi- 
ang, a materials scientist at the Massachusetts 
Institute of Technology in Cambridge, who 
helped to develop the lithium iron phosphate 
battery. He showed a new type of flow battery 
this summer that uses a semi-solid ‘peanut but- 
ter-like’ electrolyte. It solves the problem of how 
to get ions to flow quickly through a solid elec- 
trolyte by turning the battery design on its head, 
using liquid electrodes that are pumped over the 
semi-solid electrolyte. He says it could dramati- 
cally reduce the cost of producing lithium-ion 
batteries. Chiang has started a company called 
24M, based in Cambridge, Massachusetts, to 
commercialize the invention, but it remains 
unproven. 


NOT JUST IONS 

As nanotechnology improves electrodes 
and electrolytes, the search for more power 
is bringing scientists full circle, back to bat- 
teries that use pure lithium rather than lith- 
ium ions. Lithium-sulfur and lithium—air 
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electrolyte to form other compounds rather 
than simply flowing through the electrolyte 
and not reacting with it. 

Lithium-sulfur batteries, similar to those 
batteries that Exxon experimented with 
in the 1970s, can store up to ten times the 
energy of a lithium-ion battery by weight. 
The problem is that the electrochemical reac- 
tions involved consume the sulfur and create 
other substances that dilute the electrolyte. 
Both processes prematurely kill the battery. 
“Ideally, you'd like an electrolyte that only 
transmits lithium,” says Balsara. “Anything 
else it does is a problem.” 

To solve this complication, Cui, Crabtree 
and others are developing nanomaterials to 
encapsulate the sulfur. Cui says that his lab 
has demonstrated a lithium-sulfur battery 
that can last through 500 to 1,000 charge- 
discharge cycles. That could be enough for 
a mobile phone or laptop, he says. But its 
energy capacity is still too low, notes Cui, 
and a commercial prototype may be five 
years away. 

Lithium-air batteries could turn out to 
be the ultimate lithium battery in terms of 
power, weight and cost. With a lithium-metal 
anode and a gaseous oxygen cathode, a lith- 
ium air battery could store as much energy 
as a lithium-sulfur battery at even less cost, 
and potentially with less weight. 

If the materials sound simple, however, 
the battery is not. For one thing, ‘air’ is a mis- 
nomer. The cathode has to be pure oxygen, 
without any of the moisture or carbon diox- 
ide found in air. The systems to purify, pump 
and store the air add 30-70% to the battery’s 
weight and size, says Crabtree. Although on 
paper lithium—air can deliver ten times the 
energy density of lithium-ion, “it will never 
achieve the factor of ten that you get from 
the back of the envelope’, he says. 

On top of that, the cathode oxidizes any 
organic electrolyte over time, diminishing 
the life of the battery. 

Worst of all, because the reaction turns 
lithium into non-conductive dilithium oxide, 
the battery is difficult to recharge. Cui says 
that researchers have demonstrated a reason- 
able number of charges and discharges of 
lithium-air test batteries by encapsulating 
dilithium oxide on a conductive substrate, 
but he estimates that lithium-air batteries 
are at least ten years away from commer- 
cialization. 

Their development may sound difficult, 
but better lithium batteries could deliver 
huge pay-offs. Beyond electric cars that could 
go 300 kilometres, they could provide the 
breakthrough that makes renewable energy 
such as solar and wind power ubiquitous, and 
finally break the developed world’s depend- 
ence on fossil fuels. m 


batteries are not categorized as lithium-ion 
batteries because the lithium reacts in the 
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Some electric cars are already using solid-state batteries, but a battery for the mass market is years away. 


TECHNOLOGY 


A solid future 


Swapping the liquid electrolyte in batteries for a safer solid- 
state interior is bringing electric cars to the mass market. 


BY JIM MOTAVALLI 


uring the first eight months of 
D 2015, a row of five electric vehicles 

sat parked in Indianapolis, Indi- 
ana, available for test drives. The compact 
vehicles are part of the first US wave of the 
ambitious BlueIndy electric car-sharing 
programme from the Bolloré Group, head- 
quartered in Paris, which is already running 
similar large-scale schemes in London and 
the French capital. 

These Bluecars are a technological mile- 
stone: with unique 30-kilowatt-hour bat- 
tery packs designed by Bolloré, they are the 
first plug-in vehicles on the road to have 
solid-state batteries, rather than the liquid- 
electrolyte batteries that many other electric 
vehicles use. 

BlueIndy president Hervé Muller says 
that the plug-in vehicles’ batteries have been 
performing well. And the safety record for 
Bolloré’s batteries is good. “We haven't had 
an issue in Indianapolis, nor in Paris with 
the 3,000 cars in service there that have 
driven more than 10 million miles,’ Muller 
says. Bolloré’s BlueIndy batteries, although 
groundbreaking, are an early iteration of 
solid-state technology and not yet ready for 


the mass market. Scientists, however, are 
working on more advanced cells. 


PROMISES AND CHALLENGES 
The lithium-ion batteries in most electric cars 
consist of a negative — usually graphite — 
anode, a positive cathode and a liquid electro- 
lyte. When these batteries release power, lithium 
ions move from the anode, through the electro- 
lyte, to the cathode. The more conductive the 
electrolyte, the better the battery performs. 
The promise of lithium-ion solid-state batter- 
ies is that they will replace the heavy and some- 
times dangerous liquid electrolyte — which in 
car batteries can be volatile at high tempera- 
tures when the battery is charged or discharged 
quickly or when packs are damaged in accidents 
— with a lighter, more versatile solid alternative. 
Although finding a solid electrolyte with 
conductivity that is comparable with liquids 
has been a challenge, the advantages are many. 
The batteries are safer because flammable com- 
ponents have been removed. They deliver more 
power because solid electrolytes mean that the 
carbon-based anodes 


can be replaced with ONATURE.COM 
lithium metal, which —Toread more about 
has a higher energy _ better batteries visit: 


density and cyclelife, — go.nature.com/xmSys6 


$96 | NATURE | VOL 526 | 29 OCTOBER 2015 


© 2015 Macmillan Publishers Limited. All rights reserved 


with less weight and cost. And without the need 
to package the liquid electrolyte safely, solid- 
state batteries can be made in more versatile 
shapes (even thin films), reducing manufactur- 
ing costs. This could make electric cars a more 
enticing proposition, with longer ranges and a 
lower purchase price. 

“Imagine batteries that do not catch fire and 
do not lose storage capacity. That is the promise 
of solid-state batteries; says Gerbrand Ceder, a 
materials scientist at the Massachusetts Institute 
of Technology (MIT) in Cambridge. 

Solid-state batteries are not quite ready for 
mass-market vehicles yet — the Bluecars’ bat- 
teries need to be warmed up and can power a 
very small car for 240 kilometres. Scientists are 
struggling to find solid materials with conduc- 
tivity similar to that of liquids, as well as work- 
ing to increase cycle life and longevity, and to 
improve the solid electrolyte’s ability to operate 
at ambient temperatures. 

Yan Eric Wang, a materials scientist who 
worked with Ceder at MIT, and his team may 
have found the ideal solid electrolyte (Y. Wang 
et al. Nature Mater. 14, 1026-1031; 2015). “We 
found a framework that could lead to iden- 
tifying electrolyte materials with high ionic 
conductivity,’ says Wang. The research, in part- 
nership with electronics firm Samsung, pointed 
to a class of compounds of lithium, phosphorus, 
germanium and sulfur called super-ionic con- 
ductors. 

A solid electrolyte allows batteries to switch to 
lithium metal anodes, says Jeff Chamberlain, a 
battery researcher at the Joint Center for Energy 
Storage Research at Argonne National Labora- 
tory in Illinois. “With a solid-state battery you 
open up the field of metallic anodes, and make 
possible a big jump in energy density. That 
would be a game changer.” 

To be commercially viable, solid-state bat- 
teries need to work reliably for years in a tough 
automotive environment. And durability is an 
issue for researchers. Lithium metal, although 
excellent at storing large amounts of energy at 
low volume, is very reactive. And it is prone to 
forming ‘dendrites’ — tiny lithium spurs that 
degrade battery performance and can cause 
a short circuit if they reach the cathode. With 
conventional lithium-ion batteries, a chemical 
separation layer guards against dendrites. 

Sam Jaffe, managing director of Cairn 
Energy Research Advisors in Boulder, Colo- 
rado, says that solid-state researchers are work- 
ing with additives in the electrolyte carrier, as 
well as ceramic shields, in an attempt to block 
dendrite formation. 


PHONES FIRST 

In the short term, as solid-state science is evolv- 
ing, the durability issue may not matter as much 
in applications such as mobile phones, because 
consumers tend to switch to newer technology 
in a year or two — whereas cars are kept for a 
decade or more. “People turn over portable elec- 
tronics very quickly,’ Chamberlain says, “so the 
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ability to inject new technology is higher.” 

SolidEnergy Systems, headquartered in 
Waltham, Massachusetts, uses technology 
developed at MIT and is targeting smartphones. 
The firm developed an improved electrode 
material for solid-state batteries that replaces the 
usual graphite with a very thin film of lithium- 
metal foil. The electrode’ ability to store lithium 
seems promising (go.nature.com/b3xkhh), and 
the higher the storage capacity, the more energy 
the battery can deliver. 

Something of a hybrid, SolidEnergy’s cell 
maximizes efficiency by using both a solid and 
an ionic liquid electrolyte — and works at room 
temperature. A high-performance car battery is 
promised within the next four years. 


INTERNATIONAL INVESTORS 

Commercial solid-state cells for mass-market 
electric cars are at least a decade away, but they 
are coming, researchers say. Solving the sig- 
nificant technical hurdles has become a central 
focus for a group of start-up companies and 
research labs. Most companies, even those that 
have generated interest from vehicle manu- 
facturers, are still in the developmental stage. 
Cosmin Laslau at Lux Research, headquartered 
in Boston, Massachusetts, says that today’s best 
lithium-ion batteries are approaching 250 watt- 
hours per kilogram (the early Nissan Leaf had 
only 140 watt-hours), an impressive measure of 
energy density, but they will struggle to reach 
350 — a performance goal set by the US Battery 
Consortium. If solid-state batteries can reach 
350 “that’s a pretty good sustainable advantage 
over traditional lithium-ion’; he says. 

This could be good news for the electric 
vehicle industry. Donald Sadoway, a materials 
chemist at MIT, says that achieving such high 
energy densities is key to widespread adoption 
of electric vehicles. “If we had batteries with 350 
watt-hours per kilogram wed have EVs with 350 
miles of range, and that's the end of petroleum,” 
he says. 

But scaling up from a cell that generates 
results in a bench test to a durable, roadwor- 
thy battery pack will be a long road. Menahem 
Anderman, a physical chemist and chief exec- 
utive of Total Battery Consulting in Oregon 
House, California, authored the California 
Air Resources Board’s 2000 report on battery 
technology. “I have not seen any indications of 
a significant breakthrough or signs that they 
have a technology likely to find its way into 
the mass-produced vehicle inside the next 
ten years,” he says. “A new development often 
shows improvement in one or two areas but 
experiences difficulties in others.” 

Chamberlain agrees, and he stretches the 
timeline a bit further. “To be in showroom cars 
ten years from now, the solid-state cells would 
essentially have to be leaving the laboratory 
now, and I don't think that’s happening? he says. 

Still, vehicle manufacturers are interested, 
whatever the timeframe, as evidenced bya rash 
of investment and acquisitions over the past few 


years. In 2014, Volkswagen bought a 5% stake 
in QuantumScape, an electronics firm in San 
Jose, California, that is hoping to benefit from 
developing solid-state technology that could 
triple the range of its forthcoming electric cars. 

Car manufacturer Toyota says that cells that it 
has developed with double the energy density of 
today’s alternatives could provide electric vehi- 
cles with a range of 480 kilometres on one 
charge. It has built prototype cells and even 
a small scooter powered by the batteries. 
Positive results have been reported for solid 
super-ionic electrolytes (N. Kamaya et al. 
Nature Mater. 10, 682-686; 2011). But the 
company says that cell-energy density is 
still far below the potential, and produc- 
tion of the batteries is not expected to start 
until the early 2020s. 

Battery-development firm Seeo, based in 
Hayward, California, which was acquired in 
August by German auto supplier Bosch, is also 
making solid-state cells. 

Ulrik Grape, a vice-president at Seeo, says 
that the company’s cells represent “a very dura- 
ble technology with good cycle life. They're as 
good or better than current lithium-ion.” He 
adds that Seeo packs could be half the weight of 
those in the current Nissan Leaf — the leading 
battery electric car worldwide in terms of sales. 
The company also says its cells achieve 350 watt- 
hours per kilogram in the lab, but the real-world 
performance could be less. 

One more player, Solid Power, which is based 
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up to 500 charging—dis- 
charging cycles of dura- 
bility. Company founder Doug Campbell says 
that Solid Power is targeting the automotive sec- 
tor, although its first market may be the armed 
forces, for use in communications equipment. 
“Troops in the field can carry 60 pounds of bat- 
teries, and if we can cut that in half, it’s a strong 
value proposition for the US military,’ he says. 

For competitive reasons, many of the lab- 
stage battery companies are keen to empha- 
size the positive aspects of their research, but 
not all are forthcoming with sample cells or 
technical details. Sakti3, based in Ann Arbor, 
Michigan, and headed by former University 
of Michigan engineer Ann Marie Sastry, is 
promising greatly improved energy density 
and an improved manufacturing process, with 
weight and cost savings, from its thin-film cells 
that operate at ambient temperatures. Sastry 
says that the company, with investments from 
General Motors and UK-based Dyson, has 
demonstrated that it can double the density of 
conventional lithium-ion at the lab scale. 

But Sakti3 has released little information 
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Lithium-ion batteries have led to fires in some aircraft. 


about its chemistry or its results, so its claims are 
difficult to verify. Laslau estimates that Sakti3 
has reached a fairly impressive 300 watt-hours 
per kilogram, but Sastry says that the company 
is not disclosing energy-density results just yet. 

Cost to manufacture is another important 
factor. “We do think that solid-state technology 
will enable much better performance, at lower 
cost, than the incumbents,” Sastry says. Her 
company is aiming for cells that cost just $100 
per kilowatt-hour, which means around $2,400 
for an average-sized electric-vehicle pack, plus 
costs for packaging, transportation and other 
factors. Nissan currently sells its lithium-ion 
packs for $5,499 in the United States, although 
it may be losing money on every sale. Asked 
if her cells could enable the goal of a 450-km, 
$25,000 electric car, Sastry says: “At least” 

Scaling up to that level, Sastry admits, will be 
challenging because of the need to invent man- 
ufacturing processes. But “everything we have 
done to date can in principle be done at scale”. 

The 30-kilowatt-hour solid-state lithium 
metal polymer battery packs in the Bluecars are 
here now, but the 240-km range is not a huge 
improvement on current technology. BlueIndy’s 
packs require heating above ambient tempera- 
ture, and that uses some of the battery’s energy 
and reduces range. 

Solid-state-battery researchers will surely 
hit some as-yet unforeseen roadblocks on the 
way to a commercially durable cell. For cars, 
conventional lithium-ion probably has at least 
a decade of dominance remaining, although 
Sadoway believes that “the rush to deployment 
would be very, very fast” if stable, high-density 
solid-state cells were developed. 

Solid-state batteries, Sadoway adds, “would 
be preferable, because they’re a lot safer”. The 
lithium-ion fires that have occurred on some 
aircraft show what can happen when you rush 
to scale up battery technology without doing 
your homework first. m 


Jim Motavalli is freelance writer based in 
Fairfield, Connecticut. 
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A researcher at the Pacific Northwest National Laboratory in Washington working on flow batteries. 


ELECTROCHEMISTRY 


Liquid assets 


Flow batteries, which release electricity through fluid-based 
reactions, could revolutionize renewable-energy storage. 


BY NEIL SAVAGE 


hen state officials flipped a switch 
earlier this year at an engineering 
company in Pullman, Washington, 


they shone a light on one possible future for 
energy storage. That switch activated the latest 
type of flow battery, the largest in the Western 
Hemisphere. Rechargeable flow batteries, which 
store energy in tanks filled with liquids, have the 
potential to be cheaper than their conventional, 
solid cousins. They are also more adaptable to 
the needs of electrical grids, which are starting 
to rely on intermittent sources of energy such as 
wind and solar cells. 

The Pullman flow installation, made up of 
big white boxes that together take up roughly 
the same amount of space as two tractor-trail- 
ers, stores 4 megawatt-hours of energy. That is 
more than enough to run four average homes for 
a month. Although the installation holds only a 


fraction of the power the grid will require, it is 
introducing a new generation of energy-storage 
technology. 

If wind and solar power are ever to provide 
a significant portion of the world’s electricity, 
new ways will be needed to store that energy. 
Existing batteries such as lithium ion and lead 
acid do not provide the necessary combina- 
tion of long-term energy storage and rapid 
delivery of energy — just think how quickly 
a lead-acid car battery can be drained by a 
driver trying repeatedly to start a car ona cold 
day, or the overnight charge that an electric 
vehicle needs. 

Flow batteries could provide an alterna- 
tive. They can store energy for a long time, 
but provide it quickly when needed; they are 
liquid-based, so inherently safer than con- 
ventional batteries; and because the energy- 
storing liquids are kept in external tanks, 
changing their storage capacity is relatively 
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simple. Most importantly, if researchers can 
develop the right combination of chemistries, 
flow batteries could be much less expensive 
over their lifetime than existing batteries. 


GO WITH THE FLOW 

A flow battery is a type of fuel cell that consists 
of two tanks, each containing an electrolyte 
made of some sort of energy-storing material 
—a metal or a polymer — dissolved in a liquid. 
One liquid is the negative side of the battery, the 
other the positive side. The liquids are pumped 
through a stack that contains positive and nega- 
tive electrodes and a membrane that keeps the 
liquids from mixing, but allows ions to cross. 
The liquids undergo a reduction and oxidation, 
or redox, reaction, transferring electrons from 
the negative to the positive tank (see ‘Renewable 
energy storage’). 

The most common version uses the transi- 
tion metal vanadium (V) dissolved in acid. Dur- 
ing discharge, ions of V** on the negative side 
of the battery are oxidized to V**. The electrons 
given up by the vanadium ions flow out to an 
external circuit, providing a current to whatever 
the battery is powering. The electrons travel to 
the positive side, where they latch onto a differ- 
ent ion, V™, reducing it to V**. During charg- 
ing, this redox reaction is reversed, removing 
electrons from the positive side to recharge the 
negative side. 

This design offers some inherent advantages. 
For one, unlike in a conventional battery, the 
power capacity and the energy capacity are 
separate. Energy is determined by the volume 
of the electrolyte, whereas power is controlled 
by the area of the electrode stack. “If you need to 
increase your capacity, you can easily just buy a 
new tank and fill it up with the chemicals,’ says 
Adam Weber, a chemical engineer at the Law- 
rence Berkeley National Laboratory in Califor- 
nia. For higher powers, you can simply run the 
electrolyte through more or larger stacks. 

Storing the electrolytes separately is also 
safer. In conventional batteries, Weber says, 
“you're trying to store a lot of power and energy 
in a little box”. This design leads to a risk of 
catastrophic discharge of stored energy that 
is absent in flow batteries. Because many flow 
batteries are water-based, they are not likely to 
catch fire, unlike the lithium-ion batteries that 
have destroyed mobile phones and grounded 
aeroplanes. 

Another problem with conventional batteries 
is that the diodes swell and shrink as ions pass 
back and forth through them, eventually lead- 
ing the materials to fracture and fail. Liquids do 
not crack, so the electrolytes in flow batteries 
can last indefinitely. 


CLEANER TRANSPORT 

Although most researchers are developing flow 
batteries with an eye towards grid storage, or to 
accompany home-based solar energy, there are 
also efforts to build flow batteries that work in 
electric vehicles. Because a flow battery can be 
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recharged simply by exchanging the liquid in it, 
owners of electric vehicles will not have to worry 
about stopping to charge their cars overnight. 
“Range doesn't become so important, as long as 
there's a filling station somewhere,’ says Carlo 
Segre, a physicist at the Illinois Institute of 
Technology in Chicago. He has a grant from the 
Advanced Research Projects Agency—Energy 
(ARPA-E) to develop a flow battery for cars 
that he says could have a range of 800 kilome- 
tres or more (Tesla Motors says that the batter- 
ies used in its Roadster vehicle have a range of 
about 640 km). Segre’s battery uses nanoparti- 
cles of nickel and nickel hydroxide suspended 
ina potassium-based electrolyte. 

The challenge in making flow batteries viable 
lies in finding designs and chemistries that pro- 
vide good long-term storage, desirable current 
and voltage characteristics, a long lifespan and 
a competitive price. The Pullman battery, and 
a few higher-power batteries — two installed in 
China, three in Japan — are all based on vana- 
dium chemistry, which provides both high 
energy and power. 

Usually, the vanadium ions are dissolved in 
sulfuric acid. But there is a limit to how many 
vanadium ions the acid can hold, capping 
how much energy a given volume of electro- 
lyte can contain. On top of that, the chemistry 
only works between 10°C and 40°C. Outside 
that range, the vanadium precipitates out of 
the acid, as either a powder or a gel. “Once 
it precipitated out it would jam your pump- 
ing system. Your battery would be dead,” says 
Wei Wang, a materials scientist at the Pacific 
Northwest National Laboratory in Richland, 
Washington. Wang and his team developed a 
solution of sulfuric and hydrochloric acids that 
can handle temperatures from —5°C to 60°C, 
and holds about 70% more vanadium than cur- 
rent systems’. The Pullman battery uses this 
new electrolyte. 

Even with this expanded range of operat- 
ing temperatures, vanadium is not the ideal 
flow-battery material. It is rare and expensive 
—a 2011 estimate’ from the energy-company- 
funded Electric Power Research Institute puts 
the cost of a vanadium redox flow battery at 
US$3,000-3,310 per kilowatt. At that price, it 
would take about $4-billion worth of vanadium 
batteries to provide California with the 1.3 giga- 
watts of storage it wants by 2020. 

To find a cheaper alternative, Wang is work- 
ing on zinc bromine batteries. Zinc is much 
cheaper than vanadium, and can carry more 
electrical charge. But Wang uses it as a solid, so 
one half of the hybrid flow battery does not flow 
and the volume of the zinc electrolyte cannot be 
changed. Still, he says, the hybrid flow battery 
allows more flexibility in system design than 
conventional devices, and its cost and perfor- 
mance may turn out to be appealing. 

Instead of replacing a liquid with a solid, 
Weber made one side of his flow battery a 
gas. During discharge in his hydrogen bro- 
mine battery, gaseous hydrogen is oxidized at 


RENEWABLE ENERGY STORAGE 


BATTERIES | OUTLOOK 


Tanks of metals or polymers in liquid make up the positive and negative sides of a flow battery. The liquids are 
pumped through electrodes, where they undergo reactions to generate electricity when a load is connected. 


Electrode 


Membrane 


Electrode 


the negative electrode, producing an ion that 
passes through the membrane and reacts with 
the bromine. The battery has reached some of 
the highest powers of any redox flow chemistry 
in the lab’. And because hydrogen is inexpen- 
sive, the system could bea winner economically. 


COST REDUCTION 
Lithium, seen as the main conventional com- 
petition for flow batteries, may have its place 
in these upstarts as well. Yet-Ming Chiang, a 
materials scientist at the Massachusetts Insti- 
tute of Technology in Cambridge, is developing 
alithium sulfur flow battery*. Researchers have 
been working on conventional lithium sulfur 
batteries, but the elements tend to react, creat- 
ing polysulfides that migrate to the electrode 
and block the flow of current. What is bad for 
conventional batteries might be good for flow 
batteries. Chiang’s set-up uses sulfur as a posi- 
tive electrode. During discharge, lithium ions 
from the negative electrolyte move to and react 
with the sulfur cathode, yielding polysulfides 
that are suspended in the electrolyte liquid. 
Charging the battery causes the polysulfides to 
dissolve, and the sulfur precipitates out, recreat- 
ing the positive electrode. And because sulfur 
is readily available, the battery should be inex- 
pensive. “The attraction of sulfur is that it really 
is essentially free as a material? Chiang says. 
His other innovation is to replace the flow 
battery's current collector, which is typically a 
stationary mesh of carbon fibres that the elec- 
trolytes must pass through. The mesh can clog, 
so instead, Chiang created a gel of nanoscale 
carbon particles, which moves along with the 
electrolyte and acts as a sort of ‘liquid wire’ to 
collect the current. That provides a greater sur- 
face area to absorb the charges from the chemi- 
cal reaction, while shortening the distance the 
molecules have to travel to give up their charge. 
Another potentially affordable option could 
be organic materials. “We're not going to run 
out of carbon, hydrogen and oxygen,” says 
Michael Aziz, a materials scientist at Harvard 
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University in Cambridge, Massachusetts. 
Aziz was inspired by plant photosynthesis to 
test quinones, a family of organic molecules 
common in plants. Two years ago, he dem- 
onstrated’ a metal-free flow battery using 
bromine on one side and a quinone molecule 
called AQDS — which occurs naturally in 
rhubarb — on the other. He has gone on to 
synthesize other versions of quinone, looking 
for the one that is least expensive and most 
workable. “Getting the overall cost down is a 
big deal, and doing it in an environmentally 
friendly way is even bigger. We think we have 
a fighting chance of doing both,” says Aziz. 
But the organic route is not without its pitfalls. 
Organic molecules tend to undergo many dif- 
ferent chemical reactions, and the wrong ones 
could produce insoluble particles that gum up 
the battery. 

In the near term, the batteries most widely 
used to store energy on the grid will be con- 
ventional lithium-ion batteries, Chiang says. 
But several types of flow battery offer combi- 
nations of efficiency, safety and cost that could 
allow them to displace conventional batteries. 
Meanwhile, if Segre and others develop work- 
able flow batteries for electric vehicles, the fact 
that they can be refilled like a car’s petrol tank 
would make them an attractive alternative to 
today’s battery technology. Although research- 
ers are still working to get just the right designs 
and chemistries, Aziz believes that flow batteries 
could be ready for a great leap forward. “In two 
years we'll have something that’s at roughly the 
industrial use level, he predicts. “I think it’s the 
next big thing.” m 


Neil Savage is a freelance science and 
technology writer in Lowell, Massachusetts. 
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Processing of lithium-ion batteries for recycling at Retriev Technologies in Anaheim, California. 


RECYCLING 


Lazarus batteries 


Battery recycling can be hard, energy intensive and 
uneconomic. But soon, dead power cells could be more 


easily resurrected. 


BY ERICA GIES 


ing an increasingly common sight in many 

countries, and city dwellers are adjusting 
to the eerie hush of electric cars — nowhere 
more so than in Norway, where one-quarter of 
new Vehicles sold in the first three months of 
2015 were electric or hybrids. But these signs of 
progress towards a more-sustainable future rely 
on big batteries that are packed with chemicals 
and mined metals, and take large amounts of 
energy to manufacture. 

Recycling can soften the environmental 
impact of batteries by reducing the energy 
required for their production, as well as the 
environmental harm caused by the disposal of 
hazardous battery materials and the mining of 
new ones. Sometimes, it can also provide an 
economic incentive, because valuable recov- 
ered materials can be sold on. 

Unfortunately, most types of batteries are 
not being recycled, primarily because bat- 
tery recycling is not required by law in most 
countries. Some US states have policies for 
certain battery types that encourage collection 


S™ panels and wind turbines are becom- 


and recycling, but they do not make battery 
recycling compulsory. A European Union law 
mandates recycling, but it is being phased in 
over time. That leaves economics as a motiva- 
tion, but in general, batteries do not contain 
enough precious metals to make recycling 
economically viable. In addition, the recycling 
processes available today reduce extracted 
metals back to their original form, meaning 
that multiple processes must be repeated to 
build a new battery from them. 

But a promising technology could provide 
a much simpler and less destructive way to 
recycle batteries in the future. And better plan- 
ning at the design stage could help. 


LEAD TAKES THE LEAD 

At present, one form of battery is already 
widely recycled. Lead-acid batteries (which 
are ubiquitous in petrol-fuelled cars) have 
about a 98% recycling rate in most developed 
countries, primarily because lead is toxic and 
its disposal is heavily regulated. Furthermore, 
most car batteries are replaced by professional 
mechanics, who are plugged into the battery- 
recycling network. 
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Recycling lead-acid batteries also makes 
economic sense: it costs less than mining new 
lead, and because this heavy metal makes up 
about 65% of the battery’s mass, it is easy to 
recover. The process saves money for the 
manufacturers and is lucrative for scrap deal- 
ers, says Joe Acker, who has worked in lead- 
acid battery recycling for 25 years and is now 
president of Retriev Technologies, formerly 
Toxco, a battery-recycling company in Ana- 
heim, California. 


LITHIUM-ION LAGS BEHIND 

But whereas lead-acid batteries are almost 
always recycled, it is much less common for 
the lithium-ion batteries in mobile phones, 
portable electronics and electric cars to be 
recovered. In the EU, just 5% of the lithium- 
ion batteries sold in 2010 were recycled. 

One reason for this low rate is that recy- 
cling lithium-ion batteries can be compli- 
cated. Lead-acid batteries have few materials 
and designs, says Linda Gaines, an analyst 
for the Center for Transportation Research at 
Argonne National Laboratory in Illinois, and 
breaking them apart to recover constituent 
parts is easy because the product is standard- 
ized across the market. Conversely, lithium- 
ion batteries have various chemistries and 
shapes. The cathode could include cobalt, 
nickel, manganese, phosphorus and iron along 
with lithium, whereas the anode is typically 
carbon, and the organic electrolyte contains 
a lithium salt. Copper, aluminium and steel 
are also used as the substrate for the electrode 
materials and for the housing. 

Much of the lithium-ion battery recycling 
so far has been motivated by the extraction 
of cobalt from the cathode, along with, to a 
lesser extent, nickel and copper. The value of 
these elements makes recycling economically 
attractive. In particular, cobalt is expensive and 
difficult to obtain; one of its primary sources is 
the conflict zone in the Democratic Republic 
of the Congo. 

However, cobalt is gradually being phased 
out as researchers find cheaper cathode mate- 
rials. So newer batteries, including “most of 
those now being built for vehicles, are less 
attractive for recycling by current methods’, 
says Gaines. 

Methods of recycling in commercial use 
today are based on metal refining. First and 
most widespread is pyrometallurgy, in which 
batteries are fed into a smelter and melted 
down to recover metals. In older lithium-ion 
batteries, cobalt might make up 18% of the 
battery. But the rest of the materials, including 
lithium and aluminium, end up in slag and are 
then buried or embedded in concrete. 

The Brussels-based battery-recycling 
company Umicore uses pyrometallurgy to 
recover cobalt, nickel, copper and other met- 
als from lithium-ion batteries and nickel-metal 
hydride batteries. However, profits from selling 
recovered metals are not a major driver of its 


RETRIEV TECHNOLOGIES 


LEFT: UMICORE; RIGHT: D. BOSWELL/J. ALBERTS 


recycling operation, says Maarten Quix, the 
firm's head of battery recycling. Rather, bat- 
tery manufacturers pay the company to recy- 
cle their products. “Producers and collectors 
pay for the service in order to ensure that the 
credentials of ‘green’ products are solid, or to 
meet legislation targets,” he says. 

A second method, hydrometallurgy, is used 
by recyclers such as Retriev Technologies. In 
this process, cells are chopped up in water and 
separated into various product streams on the 
basis of what sinks, floats or hangs in solution. 
Chemicals are then added to the water, where 
they react with cathode components to make 
new compounds that are easier to separate. 


DIRECT RECOVERY 

But both pyrometallurgy and hydrometallurgy 
have limitations. Aside from their high energy 
and emissions footprint, many materials, 
including lithium and aluminium, end up in 
the waste stream after smelting, which means 
that more materials must be mined to build 
new batteries. “The model as it is now drains 
material away from batteries,” says Steve Sloop, 
president of OnTo Technology, a research and 
development company in Bend, Oregon, that 
is working on recycling batteries. 

Sloop developed a method called direct 
recovery. Rather than recovering basic ele- 
ments, as with pyrometallurgy, or partially 
breaking down the molecular structure of com- 
pounds, as with hydrometallurgy, this method 
involves bathing the cathode in a soft chemi- 
cal solution to rejuvenate it. The reactions are 
topotactic, meaning that 


structural elements ofthe “A luminium, 
cathode are retained, but plas tics, 

its chemical composition carbon and 

is changed. In this way, metal parts of 
cathode materialssuchas the cellcanall 


lithium that are not valu- bereused.” 
able enough to be recov- 

ered by pyrometallurgy or hydrometallurgy can 
be reused in batteries. It is a low-temperature, 
low-energy, low-emissions process with very 
limited waste, he says. 

“Tt’s really cheap to do this,’ says Sloop. Most 
of the cost savings come from not having to 
rebuild the cathode again, and direct recov- 
ery uses less energy than pyro- and hydro- 
metallurgical processes. It also produces 
fewer emissions and waste, saving money on 
compliance with air-pollution regulations and 
disposal fees. 

The process is a closed-loop system, says 
Sloop: “Aluminium, plastics, carbon, metal 
parts of the cell can all be reused.” He calculates 
that because of this thriftiness, direct recovery 
requires less than 2% of the energy required 
for new metal refining. Nor does it depend 
on large volumes; recyclers can add modular 
plants as the volume of material ramps up. 

Gaines says that she sees great promise in 
direct recovery, which is also being developed 
by Retriev. “I don’t see how any of the other 
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Pyrometallurgical processes convert used batteries into metal alloy (left); Steve Sloop has developed a 
method that produces fewer emissions and waste (right). 


processes that have been developed so far can 
produce a valuable product from the cathodes 
that don’t have valuable elements in them, says 
Gaines. With direct recovery, she says, “the 
value lies in retaining the materials in their 
battery-ready state”. 

Although direct recovery is not yet being 
used commercially, early tests are promising, 
says Sloop. “The performance levels are as 
high as with new materials.” OnTo Technol- 
ogy is testing a car-grade battery built with 
rejuvenated material from a worn-out electric- 
vehicle battery. “It’s still cycling, 2,000 cycles 
later,” he says. 


POLICY PUSH 

Until direct recovery has proved itself, top- 
down regulations can help to kick-start 
recycling programmes and drive innovation 
— as happened with lead-acid batteries. In 
the EU, a manufacturer is responsible for its 
product through its entire life cycle — espe- 
cially collection, recycling and final disposal. 
The 2006 EU Battery Directive set a timeline 
for recycling progressively greater percentages 
of batteries and made the battery producers 
responsible for those efforts. 

The programme is still in its early days — the 
directive’s first target was 25% battery recycling 
by September 2012 and the next target is 45% 
by September 2016. The data so far are murky, 
says Gaines, so it is not yet clear how successful 
the directive has been in increasing recycling. 

In the United States, there is no federal 
regulation for battery recycling, leaving a 
patchwork of state-level rules. Industry has 
made some effort to fill that gap. Since 1996, 
a voluntary programme called Call2Recycle, 
which is funded by electronics manufacturers, 
has recycled more than 38.5-million kilograms 
of small consumer batteries and mobile phones 
in the United States and Canada, but that is 
still just a tiny fraction of the total. Carl Smith, 
Call2Recycle'’s president, says that the organi- 
zation recycles perhaps 2% of alkaline batteries 


and possibly 12% of lithium-ion ones. British 
Columbia, Quebec and Manitoba in Canada 
have mandatory recycling programme, and in 
these places, Call2Recycle collects about 25% 
of what is sold, says Smith. 


PLANNING AHEAD 

Prescient product design could help to increase 
recycling. Gaines has some ideas for strategies 
to facilitate recovery and reuse, such as using 
the minimum number of components, stand- 
ardizing formats and materials, and avoiding 
toxic materials (such as cadmium, arsenic, 
mercury or halogens). She also recommends 
designs that more easily allow separation, such 
as nuts and bolts instead of welds. 

Today’s batteries are difficult and labour- 
intensive to disassemble because not much 
thought goes into their end of life, says Acker. 
More than 130 energy-storage companies 
operate in California, but end-of-life concerns 
are widely ignored by innovators and the ven- 
ture capitalists who fund them. Instead, they 
prioritize performance and cost, the better to 
grab market share. San José State University 
in California offers a battery-focused mas- 
ter’s degree in electrical engineering that has 
courses on various kinds of electrochemistry, 
manufacturing and technology, but “end-of- 
life considerations do not enter into it’, says 
Shahab Ardalan, the electrical engineer who 
oversees the master’s programme. 

That lack of foresight is unfortunate, says 
Gaines. “If you dont think about recycling 
early, you could develop some battery chem- 
istry that’s going to be completely intractable.” 

Nevertheless, she is hopeful about the future 
of battery recycling, particularly because direct 
recovery holds the promise of making it much 
more economic. Even if governments fail to 
wield policy sticks, economic carrots could 
finally deliver a greener energy future. m 


Erica Gies is a freelance science writer in 
Victoria, British Columbia. 
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ENERGY STORAGE 


Power revolution 


Electrical grids increasingly depend on intermittent renewable sources. To smooth the supply 
out, utilities companies are testing alternatives to storing energy in conventional batteries. 


BY PETER FAIRLEY 


mer’s day in California. Millions of solar 

panels are soaking up the Sun’s rays to 
power the air-conditioning systems that keep 
homes and offices throughout the state cool. 
The devices are working efficiently thanks 
to an intelligent conversation taking place 
between the appliances and the electrical 
grid. As clouds drift across the Sun, casting 
shadows, the air conditioners deftly increase 
or decrease their output in sync with the var- 
ying flow of solar energy. In areas where the 
demand for electricity looks as though it will 
overload the power-transmission lines, home 
air-conditioning units take it in turns to go 
offline for an hour. In other areas, where 
solar power threatens to exceed demand, 
hot-water heaters are turned on to absorb 
the extra energy. 

This imagined future power grid dem- 
onstrates the same degree of flexibility 
that energy-storage advocates predict will 
occur with the widespread implementation 
of batteries, but there is no electrochem- 
istry involved — software manipulates 
energy-consuming equipment so that most 


lE is 2025 and another sweltering sum- 


electricity is used when it is most abundant, 
cheap or green. 

The concept is called ‘demand dispatch, 
because it would activate and deactivate 
power demand — much as grid operators 
dynamically dispatch electricity generated 
by power plants today. In the future, power 
grids will probably use both the ‘virtual stor- 
age’ created by demand dispatch and the true 
energy storage from batteries. But demand 
dispatch could be the bigger player of the 
two, with smart use of existing appliances 
offering a smaller environmental footprint 
and slimmer price tag than batteries. 

“You can create flexibility much more 
cheaply by controlling electrical loads,” says 
Sean Meyn, director of the Florida Institute 
for Sustainable Energy in Gainesville. “You 
do have to install communications equip- 
ment, but after that there’s nothing to wear 
out, nothing to replace” 

Meyn is not alone in anticipating that 
demand dispatch — rather than batteries — 
will be the first line of defence for future grids 
(see ‘Storage solutions’). A report published in 
May by the Massachusetts Institute of Technol- 
ogy Energy Initiative (go.nature.com/ogv7wa) 
said that controlling demand to maximize 
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consumption of solar power as it is generated is 
cheaper than storing excess power in batteries. 
“Load shifting to hours when [solar] genera- 
tion is high should be the first resource to look 
at,” notes the report. 

This non-battery option is already com- 
ing online as researchers and commercial 
developers establish communication pro- 
tocols to link energy-consuming equipment 
to grid operators, develop algorithms for 
optimizing their deployment and experi- 
ment with the business models required to 
finance the systems. But demand dispatch 
also faces concerns about its security and 
its financial benefits. A major test in the US 
Supreme Court this October could unleash 
the potential of demand dispatch, or leave it 
languishing on the sidelines to be eclipsed by 
battery storage. 


REACTING TO DEMAND 

Utilities companies around the world must 
keep spare power plants sitting idle to serve 
energy-demand spikes that may occur on just 
a few days each year. To keep the number of 
extra plants to a minimum, they use ‘demand- 
side management to help to control the need 
for power. This includes giving rebates for the 


SEBASTIEN THIBAULT 


use of efficient appliances or implementing 
time-of-day pricing that incentivizes energy 
use when grid demand is low. Many utilities 
firms also practise ‘demand response; in which 
they pay selected power users — usually facto- 
ries and other large industrial and commercial 
operations — to cut their power consumption 
when the power grids are really strained. 

Demand dispatch to provide virtual energy 
storage is an advanced form of demand 
response, the growth potential of which is lim- 
ited by its disruptive impact on power users — 
shutting down a factory to save energy means 
lost revenue for the owners. Demand dis- 
patch is about automating demand response 
and turning it into a more flexible, dynamic 
asset that customers hardly notice. The key is 
tapping into equipment that can be shut off 
for a few minutes or hours without incon- 
veniencing users. Not all loads are a good fit, 
admits Mary Ann Piette, head of the Build- 
ing Technology and Urban Systems Division 
at Lawrence Berkeley National Laboratory in 
California. Lighting is hard to switch off unob- 
trusively, for example. 

But appropriate equipment is more com- 
mon than you might think, she says. Electric 
water heaters, refrigerators and air-condition- 
ing units can all be shut off for several hours at 
atime without affecting customers, thanks to 
thermal inertia, creating a window for virtual 
storage with little or no inconvenience. 

Conventional demand response is exclu- 
sively about reducing load, but demand dis- 
patch can often absorb electricity as well. Just 
as a battery can be charged and discharged, 
some electrical devices can be directed to pre- 
heat water, precool a space or bump up the 
pressure in a water line when strong winds and 
bright sunshine are providing surplus power. 
But aggregating many devices to create a sub- 
stantial and reliable virtual-storage resource 
makes demand dispatch a more complex tool 
than batteries. “It’s trickier,’ admits Robert 
Pratt, who leads the smart-grid programme at 
the Pacific Northwest National Laboratory in 
Richland, Washington. “But it may be cheaper” 

Piette says that the system her group 
designed for commercial buildings costs 
US$200-300 per kilowatt, considerably less 
than battery systems that can cost more than 
$1,000 per kilowatt. 

Meypn has studied the potential for control- 
ling air-conditioning systems in commercial 
buildings to absorb the short-term minute-to- 
minute variability in solar power. He says that 
many commercial air-conditioning systems 
could keep supply and demand in balance by 
adjusting their fan speeds depending on the 
availability of solar power. His studies at a 
building on the University of Florida campus 
suggest that this would have no detectable effect 
on comfort, and would add no cost to running 
the equipment. The grid impact, meanwhile, 
looks large. “We can accommodate a grid pow- 
ered 50% by renewable energy without the use 


of batteries,” says Meyn. 

Entrepreneurial firms are already beginning 
to make money by doing what Meyn proposes. 
ENBALA Power Networks in Vancouver, 
Canada, which has been selling grid-balancing 
services since 2011, taps into large power loads 
that can be controlled variably, such as indus- 
trial water pumps, and adjusts them in response 
to signals from grid operators to use the avail- 
able energy more efficiently. 

Several demonstration projects completed 
in the past three years have tested demand 
dispatch as a means of smoothing out larger 
energy swings over several hours, such as the 
large shifts in wind-power output caused by 
changing weather systems. PowerShift Atlan- 
tic, a demonstration completed this year by 
a consortium of power-grid operators in 
three of Canada’s maritime provinces, cho- 
reographed 17 megawatts of equipment at 
1,400 homes and businesses to absorb bursts 
and troughs in the wind power that, at cer- 
tain times, generates more than half of the 
region’s electricity. 

Controlling 20-30% of power demand 
would be enough to 
almost flatten out 


Toren Oe “We can 
the variability in accommodate 
the region’s power a grid powered 
supply. says Tom 59% by renewable 
sterhus, chief ithout 
executive of Integral ete) ee 
the use of 


Analytics, the com- 
pany in Cincinnati, 
Ohio, that made 
the control software for PowerShift Atlan- 
tic. Michel Losier, the project’s manager and 
director of customer and community engage- 
ment for utility firm NB Power in New Brun- 
swick, Canada, says that on a typical winter's 
day, when the outside air temperature is often 
below freezing, water tanks and electric heat- 
ing account for about 55% of New Brunswick's 
peak load — exactly the kinds of appliances 
that could be controlled to shape demand. 


batteries.” 


REMOTE CONTROL 

Researchers and entrepreneurs face several 
technical hurdles to turn projects like Power- 
Shift Atlantic into the reality for power-grid 
operation. Tapping the virtual-storage potential 
of millions of devices is a massive communica- 
tions and control challenge. And, because many 
of those devices reside in homes, it means deal- 
ing with residential power customers. 

Forging a reliable communications link 
between residential power devices and 
grid operators emerged as a challenge in 
both PowerShift Atlantic and a series of recent 
smart-grid projects funded by the US Depart- 
ment of Energy. Utilities often struggled to get 
various systems to communicate with each 
other. Other projects found that communica- 
tion using smart meters, many of which are 
optimized to exchange data with the utility 
every 15 minutes, was too slow for centralized 
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dispatch of virtual storage. 

As a result, operators are trying to bypass 
the smart meters and link to customers’ home 
appliances using their broadband Internet 
connections. A standard protocol for Internet 
communications called OpenADR, which was 
developed by Piette’s group, makes it easy for all 
the parts of the system to talk to each other. As 
the broad interconnection of consumer devices 
known as the Internet of Things progresses, 
costs will drop further and more loads will be 
able to participate, says Piette. “The future is 
rosy for virtual storage. Thanks to the Inter- 
net of Things, the cost of telemetry is getting 
cheaper” 

But using the Internet comes with a caveat: it 
potentially leaves consumers and power grids 
vulnerable to cyberattacks and breaches in con- 
fidentiality. Pratt says that these risks will grow 
as connected appliances link to an ever-wider 
array of sites such as social media and online- 
shopping outfits. “If your refrigerator is not only 
connected to the grid but also running through 
the Wild Wild West of the Internet, that may 
not be good, he says. “It’s an emerging concern 
that in the Internet of Things your devices will 
be so promiscuous in who they talk to that we're 
really opening up Pandora’ box.” 

Pratt, Meyn and others are working on 
control schemes that could reduce the 
cybersecurity threat by relying on more dis- 
tributed intelligence, with decisions made by 
the devices themselves rather than a central 
grid computer. Meyn is developing a dis- 
tributed control scheme that he hopes will 
ultimately require no more than one digital 
handshake per day, rather than constant com- 
munication between devices and the grid. 

Then again, Internet connectivity also 
offers powerful opportunities, and Internet 
companies such as Google are keen to take 
advantage. Google acquired home-automation 
firm Nest Labs, based in Palo Alto, California, 
for $3.2 billion in 2014, and it is using it to 
build a virtual energy-storage system. 

Nest’s adaptive thermostats use sensors to 
guess when homes are unoccupied and then 
automatically put their heaters or air-condi- 
tioning units into a low-energy mode, making 
energy conservation easy. But they are also 
connected to the Internet, enabling the ther- 
mostat to factor local weather predictions into 
its behaviour. And, for homeowners who sign 
up to Nest’s demand-dispatch programmes, 
connectivity creates a pool of potential virtual 
storage that utilities can access, through Nest, 
to dynamically adjust energy demand. 

Nine utilities in the United States and Can- 
ada have signed up for Nest's demand-dispatch 
programmes, including Austin Energy in 
Texas, which had enrolled nearly 9,000 custom- 
ers with smart thermostats by June. Together 
those homes can absorb or release up to 10.7 
megawatts of power — a virtual storage capa- 
bility that the utility expects to use 12-15 times 
per year to control demand spikes on hot days, 
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Storage solutions 


eee 


Grid-scale batteries face competition 

not only from ‘virtual storage’ systems 

that manipulate power demand, but also 
from other energy-storage technologies. 
Well-established alternatives include 
pumped storage hydropower, flywheels and 
compressed air. 

Pumped-storage plants are the most 
affordable and proven means of large- 
scale energy storage, and they account for 
97.5% of energy-storage capacity installed 
on global power grids, according to the 
US Department of Energy. Most plants 
charge up by pushing water uphill using 
surplus energy generated overnight (when 
consumption is low); during peak load times, 
the water flows downhill and spins a turbine 
to generate power. Utilities are now eyeing 
pumped storage as a means of managing 
growing flows of renewable energy. 

For example, the Kauai Island Utility 
Cooperative (KIUC), which runs the power 
grid for the Hawaiian island of Kauai, 
plans to spend between US$55 million 
and $65 million to repurpose disused 
sugar-plantation reservoirs to create a 
25-megawatt pumped-storage plant to 
help it to manage solar power. The KIUC 
estimates that storing energy from solar 
panels for use at night will be roughly 35% 
cheaper than running its oil-fired power 
plants. It will take time. The KIUC began 
studies in earnest in 2014, and says that it 
will be 2019 or later before the plant is built. 


In pumped-storage hydropower stations, water is released to create energy when demand is high. 


Flywheels, mechanical devices that are 
used to generate a power reserve, compete 
with batteries in applications that frequently 
require large power bursts for short periods, 
such as buffering power lines serving 
industrial mining equipment — heavy 
cycling conditions that degrade the capacity 
of batteries. Several recent flywheel projects 
target smaller power systems coping with 
high levels of intermittent renewable energy, 
using the flywheels to absorb and release 
power within microseconds to stabilize 
power demand in real time. In June, ABB 
in Zurich, Switzerland, installed a flywheel 
system on Alaska’s Kodiak Island to help 
the local utility to cope with high levels of 
wind power. Darron Scott, president of the 
Kodiak Electric Association, says that the 
2-megawatt flywheel replaced a 3-year-old 
lead-acid battery system that was wearing 
out faster than expected. 

Another storage option is using 
electricity to compress air, which can then 
be converted back to electricity by using 
the air to spin a gas turbine. Early systems 
used underground caverns, including 
large installations in Alabama and in 
Germany. However, similar projects have 
been stymied by high equipment costs, 
energy losses and limited availability of 
appropriate geological formations. More- 
recent efforts focus on portable, distributed 
systems that store compressed air in steel 
or composite tanks. P.F. 
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when wholesale energy costs can rise from 
less than $50 per megawatt-hour to more than 
$1,000. 


POWER PITFALLS 

The biggest barrier to demand dispatch may 
be regulatory uncertainty over how it will be 
incorporated into the energy system, and how 
much grid operators should pay for it. There 
are positive signs for the market, as some states 
and regions forge ahead with rules that embrace 
demand dispatch, including PJM Interconnec- 
tion, the US grid operator for 13 mid-Atlantic 
states and the District of Columbia. 

Under US President Barack Obama, the Fed- 
eral Energy Regulatory Commission (FERC) 
has enthusiastically embraced demand-side 
management. In 2011, FERC’s Order 745 
authorized wholesale power markets to pay 
companies that aggregate demand response 
the same price for managing megawatt-hours 
of demand as power-plant operators receive for 
generating energy. New York’s Public Service 
Commission announced in June a mandate for 
all of the state's utilities to offer load-manage- 
ment programmes by mid-2016. 

Demand response is also growing rapidly 
internationally. European countries are open- 
ing their energy markets to demand-response 
resources, and Japan and China are running 
pilot projects. Navigant Research, which spe- 
cializes in clean-technology markets, pre- 
dicts that annual global spending on demand 
response will grow from $2 billion in 2015 to 
more than $12.8 billion by 2023 as it expands 
beyond North America. 

Despite these positive signs, there are still 
many energy markets in which regulators 
are taking a cautious approach, questioning 
whether the megawatt-hours of virtual stor- 
age that the technology promises will actually 
materialize when they are needed. This Octo- 
ber, the US Supreme Court will decide whether 
to uphold a challenge — that compensation is 
a matter for state regulators — to FERC Order 
745 by power generators. Advocates of demand 
dispatch, such as Meyn, say that the ruling is 
crucial for the technology’s future. 

Mei Shibata, co-founder of ThinkEco, a vir- 
tual-storage start-up in New York City, says that 
court challenges have already slowed growth in 
the United States by delaying the start of dem- 
onstration projects needed to convince energy 
regulators that the technology works. She pre- 
dicts a big shakeout coming as venture-funded 
companies developing demand that can be dis- 
patched run short of funds waiting for markets 
to mature. “The market will grow for residential 
and automated demand response, and conver- 
gence of demand response and the Internet of 
Things will happen, giving utilities better con- 
trol over energy use,’ she says. “But how that 
happens isn't going to bea very pretty picture. m 


Peter Fairley is a freelance science writer 
based in Victoria, British Columbia. 
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The energy density 

of batteries will need 

to be substantially 
increased and their cost 
decreased if renewable 


energy is to replace 
fossil fuels. Here are four 
important questions. 


BATTERIES 


4 BIG QUESTIONS 


BY KATHERINE BOURZAC 


How do we make 
batteries less 
expensive? 


The high price of batteries 
makes it more costly to integrate 
renewable energy — which can 


L be intermittent and so needs to 
~ be stored — into the grid. It also 


means that the battery pack 
makes up the lion’s share of the 
cost of electric vehicles. 


The true cost of a battery is 
not only the cell cost, but also 
the packaging, installation, 
lifetime and storage capacity. 


Tesla Motors’s 10-kilowatt- 


hour home storage system 
could cost about US$7,000, 
once all installation costs 
have been factored in. 


Companies can improve their 
packaging technology to lower 
costs without having to make 
changes to the fundamental 


~ chemistry of the battery. Jun Liu 


at Pacific Northwest National 
Laboratory in Washington, says 
that the cost needs to be reduced 
by half in the next few years. 


How can we make 
batteries that store 
more energy? 


How can we make 
batteries safer? 


Are batteries enough 
for a renewable grid? 


Increasing the amount 
of energy a battery can 
store in a given weight or 


: volume is the best way to 


bring down costs. 


If something goes wrong, such 
as overheating or a short circuit, 
batteries can catch fire. The 
Boeing 787 Dreamliner fleet was 


| grounded after batteries caught 


fire on planes in 2013. No one 
was injured, but the incidents 
caused people to imagine a 
worst-case scenario. 


A full transition to renewable 
energy will require more 
battery capacity than 


. currently exists in the world. 


The gold standard for 
battery energy-storage 
capacity is lithium ion, which 


. is around 250 watt-hours 


per kilogram. 


Batteries on the market today 
are generally safe. But as 
battery packs get bigger, are 
more widely adopted and 


“store more energy, risks will 


increase. 


World production of 
lithium-ion batteries is about 
35 gigawatt-hours per year. 


». That will increase, but even 
"with new electrode materials 


it is doubtful whether 
batteries can meet the 
world’s electricity needs. 


Many battery electrode 
materials under development 
can store up to ten times more 


"energy by weight than lithium- 


ion. On the anode side, these 
include silicon or pure lithium; 
on the cathode side alternatives 
include sulfur or air. 


Researchers are coming up with 
protective coatings to prevent the 
growth of lithium-metal dendrites 


: that cause short circuits, working 
* on packing battery cells so that a 


fire in one cell will not spread to 
the next, and designing systems 
to sense problems and turn the 
battery off before a fire starts. 


Non-electrochemical storage 
technologies are needed. 
Pumped hydropower can 


- store a lot of energy, but it 


only works where there is 
water and mountains; other 
geographies are better suited 
to geothermal storage. 


Katherine Bourzac is a science journalist based in San Francisco. 
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An ultrafast rechargeable aluminium-ion battery 


Meng-Chang Lin'?*, Ming Gong'*, Bingan Lu'**, Yingpeng Wu'*, Di-Yan Wang’*°, Mingyun Guan’, Michael Angell!, 


Changxin Chen!, Jiang Yang’, Bing-Joe Hwang® & Hongjie Dai! 


The development of new rechargeable battery systems could fuel var- 
ious energy applications, from personal electronics to grid storage’. 
Rechargeable aluminium-based batteries offer the possibilities of 
low cost and low flammability, together with three-electron-redox 
properties leading to high capacity’. However, research efforts over 
the past 30 years have encountered numerous problems, such as 
cathode material disintegration’, low cell discharge voltage (about 
0.55 volts; ref. 5), capacitive behaviour without discharge voltage 
plateaus (1.1-0.2 volts® or 1.8-0.8 volts’) and insufficient cycle life 
(less than 100 cycles) with rapid capacity decay (by 26-85 per cent 
over 100 cycles)*’. Here we present a rechargeable aluminium bat- 
tery with high-rate capability that uses an aluminium metal anode 
and a three-dimensional graphitic-foam cathode. The battery oper- 
ates through the electrochemical deposition and dissolution of alu- 
minium at the anode, and _ intercalation/de-intercalation of 
chloroaluminate anions in the graphite, using a non-flammable 
ionic liquid electrolyte. The cell exhibits well-defined discharge 
voltage plateaus near 2volts, a specific capacity of about 
70mA hg" and a Coulombic efficiency of approximately 98 per 
cent. The cathode was found to enable fast anion diffusion and 
intercalation, affording charging times of around one minute with 
a current density of ~4,000 mA g“' (equivalent to ~3,000 W kg’), 
and to withstand more than 7,500 cycles without capacity decay. 
Owing to the low-cost, low-flammability and three-electron redox 
properties of aluminium (Al), rechargeable Al-based batteries could in 
principle offer cost-effectiveness, high capacity and safety, which would 
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Figure 1 | Rechargeable Al/graphite cell. a, Schematic drawing of the Al/ 
graphite cell during discharge, using the optimal composition of the AICI;/ 
[EMIm]Cl ionic liquid electrolyte. On the anode side, metallic Al and AlCl. 
were transformed into Al,Cl," during discharging, and the reverse reaction 
took place during charging. On the cathode side, predominantly AICL, was 


lead to a substantial advance in energy storage technology**. However, 
research into rechargeable Al batteries over the past 30 years has failed 
to compete with research in other battery systems. This has been due 
to problems such as cathode material disintegration’, low cell discharge 
voltage (~0.55 V; ref. 5), capacitive behaviour without discharge voltage 
plateaus (1.1-0.2 V, or 1.8-0.8 V; refs 6 and 7, respectively), and insuf- 
ficient cycle life (<100 cycles) with rapid capacity decay (by 26-85% 
over 100 cycles)*’”. Here we report novel graphitic cathode materials 
that afford unprecedented discharge voltage profiles, cycling stabilities 
and rate capabilities for Al batteries. 

We constructed Al/graphite cells (see diagram in Fig. 1a) in Swagelok 
or pouch cells, using an aluminium foil (thickness ~ 15-250 um) anode, 
a graphitic cathode, and an ionic liquid electrolyte made from vacuum 
dried AICl;/1-ethyl-3-methylimidazolium chloride ([EMIm]Cl; see 
Methods, residual water ~500 p.p.m.). The cathode was made from either 
pyrolytic graphite (PG) foil (~17 pm) or a three-dimensional graphitic 
foam?"°. Both the PG foil and the graphitic-foam materials exhibited 
typical graphite structure, with a sharp (002) X-ray diffraction (XRD) 
graphite peak at 20 ~ 26.55° (d spacing, 3.35 A; Extended Data Fig. 1). 
The cell was first optimized in a Swagelok cell operating at 25 °C with 
a PG foil cathode. The optimal ratio of AlCl;/[EMIm]Cl was found to 
be ~1.3-1.5 (Extended Data Fig. 2a), affording a specific discharging 
capacity of 60-66 mA hg | (based on graphitic cathode mass) with a 
Coulombic efficiency of 95-98%. Raman spectroscopy revealed that 
with an AlCl3/[EMIm]Cl ratio of ~1.3, both AlCl, and Al,Cl, anions 
were present (Extended Data Fig. 2b) ata ratio [AlCl, ]/[AlCl, ] ~ 2.33 
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intercalated and de-intercalated between graphite layers during charge and 
discharge reactions, respectively. b, Galvanostatic charge and discharge curves 
of an Al/pyrolytic graphite (PG) Swagelok cell at a current density of 

66mA g ". Inset, charge and discharge cycles. c, Long-term stability test of an 
AI/PG cell at 66mAg '. 
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(ref. 11). The cathode specific discharging capacity was found to be 
independent of graphite mass (Extended Data Fig. 3), suggesting that 
the entirety of the graphite foil participated in the cathode reaction. 
The Al/PG cell exhibited clear discharge voltage plateaus in the ranges 
2.25-2.0 V and 1.9-1.5 V (Fig. 1b). The relatively high discharge voltage 
plateaus are unprecedented among all past Al-ion charge-storage sys- 
tems*”. Similar cell operation was observed with the amount of elec- 
trolyte lowered to ~0.02 ml per mg of cathode material (Extended Data 
Fig. 4). Charge—discharge cycling at a current density of66 mA g ‘(1C 
charging rate) demonstrated the high stability of the Al/PG cell, which 
nearly perfectly maintained its specific capacity over >200 cycles with 
a 98.1 + 0.4% Coulombic efficiency (Fig. 1c). This was consistent with 
the high reversibility of Al dissolution/deposition, with Coulombic 
efficiencies of 98.6-99.8% in ionic liquid electrolytes” ’. No dendrite 
formation was observed on the Al electrode after cycling (Extended 
Data Fig. 5). To maintain a Coulombic efficiency >96%, the cut-off 
voltage of the Al/PG cell (that is, the voltage at which charging was 
stopped) was set at 2.45 V, above which reduced efficiencies were observed 
(see Extended Data Fig. 6a), probably due to side reactions (especially 
above ~2.6 V) involving the electrolyte, as probed by cyclic voltamme- 
try with a glassy carbon electrode against Al (Extended Data Fig. 6b). 
We observed lowered Coulombic efficiency and cycling stability of 
the Al/graphite cell when using electrolytes with higher water contents, 
up to ~7,500 p.p.m. (Extended data Fig. 6c, d), accompanied by obvious 
H, gas evolution measured by gas chromatography (Extended Data 
Fig. 6e). This suggested side reactions triggered by the presence of resi- 
dual water in the electrolyte, with H2 evolution under reducing poten- 
tial on the Al side during charging. Further lowering the water content 


of the ionic liquid electrolyte could be important when maximizing the 
Coulombic efficiency of the Al/graphite cells. 

The Al/PG cell showed limited rate capability with much lower specific 
capacity when charged and discharged at a rate higher than 1 C (Extended 
Data Fig. 7). It was determined that cathode reactions in the Al/PG cell 
involve intercalation and de-intercalation of relatively large chloroa- 
luminate (A1,Cl,) anions in the graphite (see below for XRD evidence 
of intercalation), and the rate capability is limited by slow diffusion of 
anions through the graphitic layers'*. When PG was replaced by nat- 
ural graphite, intercalation was evident during charging owing to dra- 
matic expansion (~50-fold) of the cathode into loosely stacked flakes 
visible to the naked eye (Extended Data Fig. 8a). In contrast, expansion 
of PG foil upon charging the Al/PG cell was not observable by eye 
(Extended Data Fig. 8b), despite the similar specific charging capacity 
of the two materials (Extended Data Fig. 8c). This superior structural 
integrity of PG over natural graphite during charging was attributed to 
the existence of covalent bonding between adjacent graphene sheets in 
PG”, which was not present in natural graphite. Using PG, which has an 
open, three-dimensionally-bound graphitic structure, we prevented 
excessive electrode expansion that would lead to electrode disinteg- 
ration, while maintaining the efficient anion intercalation necessary 
for high performance. 

Because high-rate and high-power batteries are highly desirable for 
applications such as electrical grid storage, the next step in the investi- 
gation was to develop a cathode material that would have reduced ener- 
getic barriers to intercalation during charging’*. We investigated a flexible 
graphitic foam (Fig. 2a), which was made on a nickel foam template by 
chemical vapour deposition””° (see Methods), as a possible material for 
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Figure 2 | An ultrafast and stable rechargeable Al/graphite cell. a, A 

scanning electron microscopy image showing a graphitic foam with an open frame 
structure; scale bar, 300 jim. Inset, photograph of graphitic foam; scale bar, 1 cm. 
b, Galvanostatic charge and discharge curves of an Al/graphitic-foam pouch cell at 
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a current density of 4,000 mA g_'. c, Long-term stability test of an Al/graphitic- 
foam pouch cell over 7,500 charging and discharging cycles at a current density of 
4,000 mA g_'. d, An Al/graphitic-foam pouch cell charging at 5,000 mA g' and 
discharging at current densities ranging from 100 to 5,000mAg ’. 
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ultrafast Al batteries. The graphite whiskers in the foam were 100 jim in 
width (Fig. 2a), with large spaces in between, which greatly decreased 
the diffusion length for the intercalating electrolyte anions and facili- 
tated more rapid battery operation. 

Remarkably, the Al/graphitic-foam cell (in a pouch cell configuration) 
could be charged and discharged at a current density up to 5,000 mA g~ A; 
about 75 times higher (that is, at a 75 C rate, <1 min charge/discharge 
time) than the Al/PG cell while maintaining a similar voltage profile 
and discharge capacity (~60 mA hg ') (Figs 1b and 2b). An impres- 
sive cycling stability with ~ 100% capacity retention was observed over 
7,500 cycles with a Coulombic efficiency of 97 + 2.3% (Fig. 2c). This is 
the first time an ultrafast Al-ion battery has been constructed with 
stability over thousands of cycles. The Al/graphitic-foam cell retained 
similar capacity and excellent cycling stability over a range of charge- 
discharge rates (1,000-6,000 mA g ") with 85—99% Coulombic effi- 
ciency (Extended Data Fig. 9a). It was also found that this cell could be 
rapidly charged (at 5,000 mA g_',in ~1 min) and gradually discharged 
(down to 100 mA g_ ', Fig. 2d and Extended Data Fig. 9b) over ~34 min 
while maintaining a high capacity (~60 mA hg_'). Such a rapid char- 
ging/variable discharging rate could be appealing in many real-world 
applications. 

We propose that simplified Al/graphite cell redox reactions during 
charging and discharging can be written as: 


4AlL,Cl, +3e7 =Al+7AlCl, (1) 


C, + AICI, =C, [AIC] + e7 (2) 


where n is the molar ratio of carbon atoms to intercalated anions in the 
graphite. The balanced AIC], and Al,Cl, concentrations in the electro- 
lyte allowed for an optimal charging capacity at the cathode, with abun- 
dant AlCl, for charging/intercalation in graphite (equation (2)), and 
sufficient Al,Cl,- concentration for charging/electrodeposition at the 
anode (equation (1). 

Ex situ XRD measurement of graphite foil (Fig. 3a) confirmed graphite 
intercalation/de-intercalation by chloroaluminate anions during 
charging/discharging. The sharp pristine graphite foil (002) peak at 
20 = 26.55° (d spacing = 3.35 A) (Fig. 3a) vanished on charging to a 
specific capacity of ~30 mAhg™, while two new peaks appeared at 
~28.25° (d= 3.15 A) and ~23.56° (d~ 3.77 A) (Fig. 3a), with peak 
intensities further increasing on fully charging to ~62 mAhg™’. The 
doublet XRD peak suggested highly strained graphene stacks formed 
on anion intercalation’’. Analysis of the peak separation (see Methods) 
suggested a stage 4 graphite intercalation compound with an interca- 
lant gallery height (spacing between adjacent graphitic host layers) of 
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Figure 3 | Al/graphite cell reaction mechanisms. a, Ex situ X-ray diffraction 
patterns of PG in various charging and discharging states through the second 
cycle. b, In situ Raman spectra recorded for the PG cathode through a 
charge-discharge cycle, showing chloroaluminate anion intercalation/ 
de-intercalation into graphite. c, After calcination of a fully charged 

(62 mAhg_') PGelectrode at 850 °C in air, the sample completely transformed 
into a white foam made of aluminium oxide. Scale bar, 1 cm. 
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~5.7A, indicating that the AlCl, anions (size ~5.28 A; ref. 19) were 
intercalated between graphene layers in a distorted state. Full dischar- 
ging led to the recovery of the graphite peak but with a broad shoulder 
(Fig. 3a), probably caused by irreversible changes in the stacking 
between the graphene layers or a small amount of trapped species. 

In situ Raman spectroscopy was also performed to probe chloroalu- 
minate anion intercalation/de-intercalation from graphite during 
cell charge/discharge (Fig. 3b). The graphite G band (~1,584cm') 
diminished and split into a doublet (1,587 cm” for the Engo(i) mode 
and ~1,608cm"' for the Eyg2(b) mode) upon anion intercalation 
(Fig. 3b)*°, and then evolved into a sharp new peak (~1,636 cm}, 
the G2 band of the E,,,(b) mode, spectrum 2.41 V, Fig. 3b) once fully 
charged. The spectral changes were then reversed upon discharging 
(Fig. 3b), as the typical graphite Raman G band (1584 cm”) was 
recovered when fully discharged (spectrum 0.03 V, Fig. 3b). Similar 
Raman spectra and XRD data were obtained with a graphitic-foam 
cathode (Extended Data Fig. 10a, b). Interestingly, calcination of a fully 
charged PG foil at 850 °C in air (Fig. 3c) yielded a white aluminium 
oxide foam (Extended Data Fig. 10c), confirming the intercalation of 
chloroaluminate anions into the carbon network, which had been evi- 
dently removed oxidatively. 

Lastly, X-ray photoelectron spectra (XPS) and Auger electron spec- 
troscopy (AES) were performed to probe the chemical nature of the 
intercalated species in our graphitic cathodes (see Methods for details). 
To minimize the amount of trapped electrolyte, graphitic foam was used 
and the electrode was thoroughly washed with anhydrous methanol. 
XPS revealed that upon charging pristine graphite, the 284.8 eV C 1s 
peak developed a shoulder at higher energy (~285.9 eV, Fig. 4a), con- 
firming electrochemical oxidation of graphitic carbon by intercalation 
of AICl, anions (equation (2)). Chloroaluminate intercalation was evi- 
dent from the appearance of Al 2p and Cl 2p peaks (Fig. 4b, c). Upon 
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Figure 4 | Chemical probing of a graphitic cathode by XPS and AES. a, XPS 
data of the C 1s peak of a graphitic-foam electrode: pristine, fully charged and 
fully discharged. b, c, XPS data of Al 2p and Cl 2p peaks observed with a 
graphitic-foam electrode: pristine, fully charged and fully discharged. d-g, AES 
mapping images for C, Al and Cl (d, f), and the AES spectrum of the boxed 
regions (e, g) obtained with a fully charged graphitic-foam sample (d, e) anda 
fully discharged graphitic-foam sample (f, g). Scale bars: d, 25 um; f, 10 jum. 
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discharging, the C 1s XPS spectrum of the cathode reverted to that of 
the pristine graphite due to anion de-intercalation and carbon reduc- 
tion (Fig. 4a). Also, a substantial reduction in the Al 2p and Cl 2p signals 
was recorded over the graphite sample (see Fig. 4b, c). The remaining 
Aland Cl signals observed were attributed to trapped/adsorbed species 
in the graphite sample, which was probed by XPS over a large area. Fur- 
thermore, high spatial resolution AES elemental mapping of a single 
graphite whisker in the fully charged graphitic foam clearly revealed Al 
and Cl Auger signals uniformly distributed over the whisker (Fig. 4d, e), 
again confirming chloroaluminate anion intercalation. When fully dis- 
charged, AES mapping revealed anion de-intercalation from graphite 
with much lower Al and Cl Auger signals observed (Fig. 4f, g). These 
spectroscopic results clearly revealed chloroaluminate ion intercala- 
tion/de-intercalation in the graphite redox reactions involved in our 
rechargeable Al cell. 

The Al battery pouch cell is mechanically bendable and foldable (Sup- 
plementary Video 1) owing to the flexibility of the electrode and sepa- 
rator materials. Further, we drilled through Al battery pouch cells during 
battery operation and observed no safety hazard, owing to the lack of 
flammability of the ionic liquid electrolyte in air (see Supplementary 
Video 2). 

We have developed a new Al-ion battery using novel graphitic cath- 
ode materials with a stable cycling life up to 7,500 charge/discharge cycles 
without decay at ultrahigh current densities. The present Al/graphite 
battery can afford an energy density of ~40 Whkg™! (comparable to 
lead-acid and Ni-MH batteries, with room for improvement by opti- 
mizing the graphitic electrodes and by developing other novel cathode 
materials) and a high power density, up to 3,000 W kg" (similar to super- 
capacitors). We note that the energy/power densities were calculated 
on the basis of the measured ~65 mA hg” cathode capacity and the 
mass of active materials in electrodes and electrolyte. Such recharge- 
able Al ion batteries have the potential to be cost effective and safe, and 
to have high power density. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Preparation of ionic liquid electrolytes. A room temperature ionic liquid electro- 
lyte was made by mixing 1-ethyl-3-methylimidazolium chloride ([EMIm]Cl, 97%, 
Acros Chemicals) and anhydrous aluminium chloride (AICI;, 99.999%, Sigma 
Aldrich). [EMIm]Cl was baked at 130°C under vacuum for 16-32 h to remove 
residual water. ([EMIm]A1,Cl,) ionic liquid electrolytes were prepared in an argon- 
atmosphere glove box (both [EMIm]C] and AICI are highly hygroscopic) by mix- 
ing anhydrous AICI, with [EMIm]Cl, and the resulting light-yellow, transparent liquid 
was stirred at room temperature for 10 min. The mole ratio of AICI; to [EMIm]Cl 
was varied from 1.1 to 1.8. The water content of the ionic liquid was determined 
(500-700 p.p.m.) using a coulometric Karl Fischer titrator, DL 39 (Mettler Toledo). 
The predominant anions in basic melts (AICl;/[EMIm]Cl mole ratio <1) are Cl™ 
and AIC, , while in acidic melts (AIC1;/[EMIm]Cl mole ratio >1) chloroalumi- 
nate anions suchas Al,Cl, ,AlsCljg ,and AlCl,3;\ are formed''. The ratio of anions 
to cations in the AICI,/[EMIm]Cl electrolyte was determined using a glass fibre filter 
paper (Whatman GF/D) loaded with a 4-8 jum Au-coated SiO, beads” in a cuvette 
cell (0.35 ml, Starna Cells) with random orientation quartz windows. Then, in the 
glove box, the cuvette cell was filled with AlCl;/[EMIm]Cl = 1.3 (by mole). Raman 
spectra (200-650 cm!) were obtained using a 785-nm laser with 2 cm! resolution. 
Raman data were collected from the surface of the Au-coated SiO, bead so as to 
benefit from surface enhanced Raman*'”’ (Extended Data Fig. 2b). 

Preparation of graphitic foam. Nickel (Ni) foams (Alantum Advanced 
Technology Materials, Shenyang, China), were used as 3D scaffold templates for 
the CVD growth of graphitic foam, following the process reported previously*”°. 
The Ni foams were heated to 1,000 °C in a horizontal tube furnace (Lindberg Blue 
M, TF55030C) under Ar (500 standard cubic centimetres per minute or s.c.c.m.) and 
H, (200 s.c.c.m.) and annealed for 10 min to clean their surfaces and to eliminate a 
thin surface oxide layer. Then, methane (CH,) was introduced into the reaction 
tube at ambient pressure at a flow rate of 10s.c.c.m., corresponding to a concen- 
tration of 1.4 vol.% in the total gas flow. After 10 min of reaction gas mixture flow, 
the samples were rapidly cooled to room temperature at a rate of 300°C min * 
under Ar (500 s.c.c.m.) and H2 (200 s.c.c.m.). The Ni foams covered with graphite 
were drop-coated with a poly(methyl methacrylate) (PMMA) solution (4.5% in 
ethyl acetate), and then baked at 110°C for 0.5h. The PMMA/graphene/Ni foam 
structure was obtained after solidification. Afterwards, these samples were put into 
a 3M HCI solution for 3h to completely dissolve the Ni foam to obtain the 
PMMaA/egraphite at 80°C. Finally, the pure graphitic foam was obtained by 
removing PMMA in hot acetone at 55°C and annealing in NH; (80s.c.c.m.) at 
600 °C for 2h, and then annealing in air at 450 °C for 2 h. The microstructure of the 
graphitic foam was examined by SEM analysis using a FEI XL30 Sirion scanning 
electron microscope (Fig. 2a in the main text). 

Preparation of glassy carbon. Glassy carbon (GC) was usedas the current collector 
in the Swagelok-type cell. 72g phenol (Sigma-Aldrich) and 4.5 ml ammonium 
hydroxide (30%, Fisher Scientific) were dissolved in 100 ml formaldehyde solution 
(37%, Fisher Scientific) under reflux while stirring rapidly. The solution was stirred 
at 90 °C until the solution turned a milk-white colour. Rotary evaporation was used 
to remove the water and get the phenolic resin. The phenolic resin was solidified at 
100 °C ina mould (1/2-inch glass tube), and then carbonized at 850 °C under an Ar 
atmosphere for four hours to obtain the GC rod. The resulting GC rod contributed 
negligible capacity to the cathode (Extended Data Fig. 6b). 

Electrochemical measurements. Prior to assembling the Al/graphite cell in the 
glove box, all components were heated under vacuum at 60 °C for more than 12h 
to remove residual water. All electrochemical tests were performed at 25 + 1 °C. 
A Swagelok-type cell (1/2 inch diameter) was constructed using a ~4 mg PG foil 
(0.017 mm, Suzhou Dasen Electronics Materials) cathode and a 90 mg Al foil 
(0.25 mm, Alfa Aesar) anode. A 1/2 inch GC rod (10 mm) was used as the current 
collector for the PG cathode, and a 1/2 inch graphite rod (10 mm) was used for the 
Al anode. Six layers of 1/2 inch glass fibre filter paper (Whatman 934-AH) were 
placed between the anode and cathode. Then, ~1.0 ml of ionic liquid electrolyte 
(prepared with AICI;/[EMIm]Cl mole ratios of 1.1, 1.3, 1.5 and 1.8) was injected 
and the cell sealed. The Al/PG cell was then charged (to 2.45 V) and discharged (to 
0.01 V) at a current density of 66 mA g! with a MTI battery analyser (BST8-WA) 
to identify the ideal AICl;/[EMIm]Cl mole ratio (Extended Data Fig. 2a). To 
investigate the Coulombic efficiency of the Al/PG cell in AICl;/[EMIm]Cl ~ 1.3 
(by mole) electrolyte, the cell was charged to 2.45, 2.50, 2.55 and 2.60 V, respectively, 
and discharged to 0.4 V at a current density of 66 mA g”' (Extended Data Fig. 6a). 
For long-term cycling stability tests, an Al/PG cell using electrolyte AICI,/ 
[EMIm]Cl~ 1.3 by mole was charged/discharged at a current density of 
66mAg_' (Fig. 1b, c in the main text). To study the rate capability of the Al/ 
PG cell, the current densities were varied from 66 to 264mA. g! (Extended Data 
Fig. 7). Note that we lowered the electrolyte amount to ~0.02 ml per mg of 
cathode material and observed similar cell operation (Extended Data Fig. 4). 
Further decrease in the electrolyte ratio is possible through battery engineering. 
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PG foil was synthesized by pyrolysis of polyimide at high temperature, in which 
some covalent bonding is inevitably generated due to imperfections. Natural graphite 
foil was produced by compressing expanded graphite flakes, leading to stacking of 
natural graphite flakes by Van der Waals bonding between them. Similar battery 
characteristics were observed with PG and graphite foil electrodes, indicating that 
the battery behaviour was derived from the graphitic property of the electrodes 
(Extended Data Fig. 8c). However, since the natural graphite foils are synthesized 
by compressing expanded natural graphite powders without the covalent linkage 
between them, these foils suffered from drastic electrode expansion obvious to the 
naked eye, whereas pyrolytic graphite foils showed no obvious electrode expan- 
sion due to covalency (Extended Data Fig. 8a, b). 

Pouch cells were assembled in the glove box using a graphitic-foam (~3 mg) 
cathode and an Al foil (~70 mg) anode, which were separated by two layers of glass 
fibre filter paper to prevent shorting. Polymer (0.1 mm X 4mm X 5 mm) coated Ni 
foils (0.09 mm X 3 mm X 60 mm in size; MTI corporation) were used as current 
collectors for both anode and cathode. The electrolyte (~2 ml prepared using 
AICl;/[EMIm]Cl = 1.3 by mole) was injected and the cell was closed using a heat 
sealer. The cell was removed from the glove box for long-term cycling stability 
tests, in which the cell was charged/discharged at a current density of 4,000 mA g ' 
(Fig. 2b, c). To determine the rate capability and fast-charge/slow-discharge beha- 
viours of the Al/graphitic-foam cell, various current densities from 100 to 
5,000 mA g' were used (Extended Data Fig. 9 and Fig. 2d). The pouch cell was 
charged to 2.42 V and discharged to a cut-off voltage of 0.5 V to prevent the 
dissolution reaction of Ni foil in the ionic liquid electrolyte. 

Cyclic voltammetry measurements were performed using a potentiostat/galva- 
nostat model CHI 760D (CH Instruments) in either three-electrode or two-electrode 
mode. The working electrode was an Al foil or a PG foil, the auxiliary electrode 
consisted of an Al foil, and an Al foil was used as the reference electrode. Copper 
tape (3M) was attached to these electrodes as the current collector. The copper 
tape was covered by poly-tetrafluoroethylene (PTFE) tape to prevent contact with 
the ionic liquid electrolyte and the part of the copper tape covered by PTFE 
was not immersed in the ionic liquid electrolyte. This prevented corrosion of 
the copper tape during cyclic voltammetry measurements. All three electrodes 
were placed in a plastic (1.5ml) cuvette cell (containing electrolyte AICI;/ 
[EMIm]Cl= 1.3 by mole) in the glove box, and then sealed with a rubber cap 
using a clamp. The scanning voltage range was set from -1.0 to 1.0 V (versus Al) 
for Al foil and 0 to 2.5 V (versus Al) for graphitic material, and the scan rate was 
10 mV s"' (Extended Data Fig. 10d). To investigate the working voltage range of the 
electrolyte without involving cathode intercalation, two-electrode measurement 
was performed by using a GC rod cathode against an Al anode in a Swagelok cell in 
AICI;/[EMIm]Cl (~1.3 by mole) electrolyte. The scanning voltage range was set 
from 0 to 2.9 V at a scan rate of 1OmVs°! (Extended Data Fig. 6b). 

We investigated the Al ion cell operation mechanism and electrode reactions in 
the ionic liquid electrolyte, using the optimal mole ratio of AICl;/[EMIm]Cl = 1.3. 
Using CV (Extended Data Fig. 10d), a reduction wave from -1.0 to -0.08 V (versus 
Al) and an oxidation wave from —0.08 to 0.80 V (versus Al) for the anode were 
observed (Extended Data Fig. 10d, left plot), corresponding to Al reduction/elec- 
trodeposition and oxidation/dissolution’****** during charging and discharging, 
respectively. This was consistent with Al redox electrochemistry in chloroalumi- 
nate ionic liquids'*!>”*° via equation (1) in the main text, and consistent with our 
Raman measurements, which showed both AlCl, and Al,Cl, in the electrolyte 
(Extended Data Fig. 2b). On the graphitic cathode side, an oxidation wave of 1.83 
to 2.50 V (versus Al) anda reduction wave of 1.16 to 2.36 V (versus Al) were observed 
(Extended Data Fig. 10d, right plot) and attributed to graphite oxidation and reduc- 
tion through intercalation and de-intercalation of anions (predominantly AlCl, ~ 
due to its smaller size), respectively. The oxidation voltage range of 1.83 to 2.50 V 
(versus Al, Extended Data Fig. 10d, right plot) was close to the anodic voltage range 
(1.8 to 2.2 V versus Al) of a previously reported dual-graphite cell’* attributed to 
AICI,” intercalation in graphite. The reduction wave range of 1.16 to 2.36 V (versus 
Al) was assigned to the AICI, de-intercalation”*’. The nature of the shoulder in the 
reduction curve of graphite ranging from 2.36 to 1.9 V (Extended Data Fig. 10d, 
right plot) and a higher discharge plateau (2.25 to 2.0 V) of an Al/PG cell upon 
charging (Fig. 1b in the main text) remained unclear, but could be due to different 
stages of anion-graphite intercalation”’. 

XRD and Raman studies of graphite cathodes during charge and discharge. 
For ex situ X-ray diffraction (XRD) study, an Al/ PG cell (in a Swagelok configu- 
ration) was charged and discharged at a constant current density of 66 mA g '. The 
reactions were stopped after 30 mA hg’ charged, fully charged (62 mA hg‘) and 
40 mA hg discharged after charge/discharge capacities were in a stable state. Fully 
charged (62 mA hg’ ') graphitic foam was also prepared. After either the charge or 
the discharge reaction, the graphitic cathode was removed from the cell in the glove 
box. To avoid reaction between the cathode and air/moisture in the ambient atmo- 
sphere, the cathode was placed onto a glass slide and then wrapped in a Scotch tape. 
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The wrapped samples were immediately removed from the glove box for ex situ XRD 
measurements, which were performed on a PANalytical X Pert instrument (Fig. 3a 
in the main text and Extended Data Fig. 10b). 

The periodic repeat distance (Ic), the intercalant gallery height (d;) and the gallery 
expansion (Ad)**”’ can be calculated using 


Io = (di +3.35A) x (n—1) = (Ad +3.35A) x N=1X dots (3) 


where | is the index of (00/) planes oriented in the stacking direction and doy, is the 
observed value of the spacing between two adjacent planes'***”’. The d spacing of 
graphite is 3.35 A. During the charging/anion-intercalation process, the graphite 
(002) peak completely vanished and two new peaks arose. The intensity pattern is 
commonly found for a stage n graphite intercalation compound (GIC), where the 
most dominant peak is the (00n + 1) and the second most dominant peak is the 
(00n + 2)'*?8”°, Based on our experimental data, by increasing the charging state 
from 48-60% charged (30 mA hg’) to the fully charged state (62 mAh g”), the 
distance between the (00” + 1) and (00n + 2) peaks gradually increased, as more 
Al.Cl, anions intercalated. The d spacing values of (00n + 1) and (00n + 2) peaks 
(that is, d(,41) and d,,,+2), respectively) were calculated from XRD data (for example, 
Fig. 3a). By determining the ratio of the di,+2)/d(,+1) peak position and correlating 
these to the ratios of stage pure GICs (that is, ideal cases), the most dominant stage 
phase of the observed GIC can be assigned”*. After assigning the (00/) indices, we 
calculated the intercalant gallery height (d;) through equation (3). 

For simultaneous in situ Raman and galvanostatic charge/discharge reaction 
measurements, a cuvette cell (0.35 ml, Starna Cells) with random orientation 
quartz windows was used. An aluminium foil and a graphitic material (PG or 
graphitic foam) were used as the anode and cathode, respectively. The electrolyte 
was mixed AICI;/[EMIm]Cl = 1.3 (by mole). The electrochemical cell was assembled 
in the glove box following the process mentioned above. Raman spectra were 
obtained (1,500-1,700 cm!) using a HeNe laser (633 nm) with 2 cm | resolution. 
The spectral data were collected after a few successive charge/discharge scans between 
2.45 and 0.01 V at a current density of 66 mA g (PG) (Fig. 3b in the main text) or 
1,000 mA g”™ (graphitic foam) (Extended Data Fig. 10a). 

XPS and AES measurements. Al/graphitic-foam cells were fully charged/discharged 
at a current density of 4,000 mA g~ | ‘Then, the Al/: graphitic-foam cells were trans- 
ferred to the glove box for preparation for XPS and AES analysis. Fully charged/ 


discharged graphitic foams were collected from the pouch cell and washed with 
anhydrous methanol to remove the residual AlCl;/EMIC ionic liquid electrolyte. 
The as-rinsed graphitic foams were attached to a Si wafer and baked at 90 °C for 
10 min to remove residual methanol. The samples were sealed in a plastic pouch to 
avoid contamination by reaction with moisture and oxygen before XPS and AES 
characterization. Auger electron spectra were taken by a PHI 700 Scanning Auger 
Nanoprobe operating at 10kV and 10nA. XPS spectra were collected on a PHI 
VersaProbe Scanning XPS Microprobe (Fig. 4 in the main text). 

TGA measurements. Fully charged PG cathodes were washed with methanol for 
24h to remove the residual AIC1;/EMIC ionic liquid electrolyte. The as-washed PG 
samples were calcined at 850 °C for 3 h in air. The as-calcined samples (white foam) 
were collected, weighed, and analysed by SEM-EDX to study the chemical com- 
position (Extended Data Fig. 10c). SEM and SEM-EDX analyses were performed 
using an FEI XL30 Sirion scanning electron microscope. 

Sample size. No statistical methods were used to predetermine sample size. 
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Extended Data Figure 1 | X-ray diffraction patterns of graphitic cathode 
materials. The natural graphite, pyrolytic graphite (PG) and graphitic foam 
exhibited typical graphite structure, with a sharp (002) X-ray diffraction (XRD) 
graphite peak at 20 ~ 26.55° (d spacing = 3.35 A). 
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Extended Data Figure 2 | Determination of the optimal mole ratio of in a Swagelok-type cell. The Coulombic efficiencies of the cells are shown in 
AICI;/[EMIm]C]l ionic liquid electrolyte. a, Galvanostatic charge and parentheses. b, Raman spectrum of the ionic liquid electrolyte with a mole ratio 


discharge curves of Al/PG cells at a current density of 66 mA g_' in various of AICI;/[EMIm]Cl = 1.3. 
mole ratios (1.1, 1.3, 1.5 and 1.8) of AlCl;/[EMIm]Cl ionic liquid electrolytes 
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Extended Data Figure 3 | Calculated discharging capacities of Al/graphite _ that the entire graphitic material (natural graphite, PG and graphitic foam) 
cells with different masses of graphitic materials. a, Natural graphite foils of _ participated in the cell cathode reaction. 
50 um and 130 ym thickness; b, PG and graphitic foam. These data suggest 
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Extended Data Figure 4 | Galvanostatic charge and discharge curves of an 
AI/PG cell. The cell was constructed with one layer of glass fibre separator and 
0.08 ml of ionic liquid electrolyte, suggesting that the minimum amount of 
electrolyte could be 0.02 ml per mg of PG. This electrochemical study was 
performed in an ionic liquid electrolyte of composition AIC1,/[EMIm]Cl = 1.3 
(by mole) at a current density of 66 mA g * in a Swagelok-type cell. 
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20 cycles: 100 cycles 


Extended Data Figure 5 | Surface observations of an Al anode. a, b, SEM images of the Al anode obtained from two Al/PG cells after 20 (a) and 100 (b) cycles, 
respectively, and indicate no dendrite formation over these cycles. Scale bars, 10 jum. 
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Extended Data Figure 6 | Electrochemical stability of the AICI;/[EMIm]Cl 
ionic liquid electrolyte. a, Galvanostatic curves of Al/PG cells with 

different cut-off charge voltages obtained at 66 mA g ' ina Swagelok-type cell. 
b, Cyclic voltammetry curve of a Al/glassy carbon (GC) cell at 10mVs ‘ina 
Swagelok-type cell. c, d, Stability test of Al/natural graphite pouch cell at 
66mAg " in electrolytes containing water at 7,500-10,000 p.p.m. (c) and 
500-700 p.p.m. (d). The Coulombic efficiencies are respectively 95.2% and 
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98.6%, and the discharge capacities are respectively 54.9 and 61.8mAhg ' 
at the 15 cycle. e, Gas chromatography spectrum of gaseous samples withdrawn 
from Al/graphite cells after 30 cycles using electrolyte with 7,500-10,000 p.p.m. 
H,0 content. The peak found in the retention time at ~0.5 min corresponds 
to hydrogen gas and matches the retention time of pure hydrogen gas used 
for calibration. 
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Extended Data Figure 7 | Rate capability of an Al/PG cell. a, Capacity 
retention of an Al/PG cell cycled at various current densities, showing good 
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efficiency versus current density data of Al/PG cells, indicating the Coulombic 
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efficiency is ~95-97% at current densities of 66-132 mA g *. Error bars, 
standard deviation from the Coulombic efficiency for each current density. 
All electrochemical studies were performed in an ionic liquid electrolyte of 
composition AlCl3/[EMIm]Cl = 1.3 (by mole) in a Swagelok-type cell. 
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A pomegranate-inspired nanoscale design for 
large-volume-change lithium battery anodes 


Nian Liu", Zhenda Lu?', Jie Zhao?, Matthew T. McDowell?, Hyun-Wook Lee?, Wenting Zhao? 


and Yi Cui??* 


Silicon is an attractive material for anodes in energy storage 
devices'?, because it has ten times the theoretical capacity of 
its state-of-the-art carbonaceous counterpart. Silicon anodes 
can be used both in traditional lithium-ion batteries and in 
more recent Li-O, and Li-S batteries as a replacement for the 
dendrite-forming lithium metal anodes. The main challenges 
associated with silicon anodes are structural degradation and 
instability of the solid-electrolyte interphase caused by the 
large volume change (~300%) during cycling, the occurrence 
of side reactions with the electrolyte, and the low volumetric 
capacity when the material size is reduced to a nanometre 
scale*-?, Here, we propose a hierarchical structured silicon 
anode that tackles all three of these problems. Our design is 
inspired by the structure of a pomegranate, where single 
silicon nanoparticles are encapsulated by a conductive carbon 
layer that leaves enough room for expansion and contraction 
following lithiation and delithiation. An ensemble of these 
hybrid nanoparticles is then encapsulated by a thicker carbon 
layer in micrometre-size pouches to act as an electrolyte 
barrier. As a result of this hierarchical arrangement, the 
solid-electrolyte interphase remains stable and spatially con- 
fined, resulting in superior cyclability (97% capacity retention 
after 1,000 cycles). In addition, the microstructures lower the 
electrode-electrolyte contact area, resulting in high Coulombic 
efficiency (99.87%) and volumetric capacity (1,270 mAh cm” °), 
and the cycling remains stable even when the areal capacity 
is increased to the level of commercial lithium-ion batteries 
(3.7 mAh cm“). 

Particle fracture and loss of electrical contact have long been 
identified as primary reasons for the capacity fading of silicon- 
based anodes. Pioneering works have shown that decreasing the 
feature size to the nanoscale allows for the material to withstand 
the large (de)lithiation strains without fracture*!°. However, the 
cycle life of nanosized silicon is still limited due to the unstable 
solid-electrolyte interphase (SEI) on the surface. At the working 
potential of anodes (<0.5 V versus Li/Li‘), the organic electrolyte 
decomposes and forms a thin SEI layer!’”'’. When the silicon 
expands and contracts, the SEI layer deforms and breaks. The for- 
mation of new SEI on the freshly exposed silicon surface causes 
the cell to have poor Coulombic efficiency, with the accumulated 
SEI eventually blocking Li* transport. As a consequence of this 
mechanism, even if most of the silicon active material remains elec- 
trically connected, the capacity decays as the SEI thickens. To 
control SEI formation, an electrolyte blocking layer and an internal 
void space need to be engineered into the structure. Such a design 
has been demonstrated in a few examples, including double- 
walled silicon nanotube and Si-C yolk-shell structures”*®, yielding 
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Figure 1 | Schematic of the pomegranate-inspired design. a,b, Three- 
dimensional view (a) and simplified two-dimensional cross-section view (b) 
of one pomegranate microparticle before and after electrochemical cycling 
(in the lithiated state). The nanoscale size of the active-material primary 
particles prevents fracture on (de)lithiation, whereas the micrometre size of 
he secondary particles increases the tap density and decreases the surface 
area in contact with the electrolyte. The self-supporting conductive carbon 
ramework blocks the electrolyte and prevents SEI formation inside the 
secondary particle, while facilitating lithium transport throughout the whole 
particle. The well-defined void space around each primary particle allows it 
o expand without deforming the overall morphology, so the SEI outside 

he secondary particle is not ruptured during cycling and remains thin. 

¢, Calculated surface area in contact with electrolyte (specific SEI area) and 
he number of primary nanoparticles in one pomegranate particle versus 
its diameter. 


‘Department of Chemistry, Stanford University, Stanford, California 94305, USA, *Department of Materials Science and Engineering, Stanford University, 
Stanford, California 94305, USA, *Stanford Institute for Materials and Energy Sciences, SLAC National Accelerator Laboratory, 2575 Sand Hill Road, 
Menlo Park, California 94025, USA, ‘These authors contributed equally to this work. *e-mail: yicui@stanford.edu 


NATURE NANOTECHNOLOGY | VOL 9 | MARCH 2014 | www.nature.com/naturenanotechnology 187 


© 2014 Macmillan Publishers Limited. All rights reserved 


LETTERS 


: 10.1038/NNANO.2014.6 


Si nanoparticle 


80 nm? @ 


| 
@-— 


Si@SiO, 


Oil phase 


Microemulsion droplet 


Carbon 
Void space v~<e* 


Si@SiO, cluster 


os 


1-10 um 


Si@SiO,@C cluster g& 


Electron counts 


14 109 104 99 94 


Si pomegranate Binding energy (eV) 


—RF-20 
—RF-40 
—RF-100 


Mass retention (%) 
co 
ro 
t 


O 200 400 600 800 
Temperature (°C) 


Figure 2 | Fabrication and characterization of silicon pomegranates. a, Schematic of the fabrication process for silicon pomegranates. b, SEM images of a 
series of silicon nanoparticle clusters with different diameters assembled by the microemulsion approach. The insets show magnified images of the individual 
microparticles. ¢, SEM images of silicon pomegranates showing the micrometre-sized and spherical morphology. d, Magnified SEM image showing the local 
structure of silicon nanoparticles and the conductive carbon framework with well-defined void space between. e, TEM image of one silicon pomegranate 
particle. f, TEM image of the carbon framework after etching away silicon using NaOH. g,h, High-resolution XPS spectra of Si 2p peaks (g) and 
thermogravimetric profiles (h) of silicon pomegranates with different thicknesses of carbon framework. X in 'RF-X' denotes X mg of resorcinol used for 

each 100 mg of Si@SiO, in the RF coating step (see Supplementary Methods). RF-O denotes silicon nanoparticle clusters without carbon. 


significant improvement in cycle life with relatively low areal mass 
loading (<0.2 mg cm”). 

Although nanostructuring has been successful in extending the 
cycle life of silicon, nanostructured electrodes have introduced 
new fundamental challenges, including higher surface area, low 
tap density and generally poor electrical properties due to the 
higher interparticle resistance. The high surface area increases side 
reactions with the electrolyte and lowers the Coulombic efficiency. 
The low tap density leads to low volumetric capacity and a thick 
electrode at high mass loading, which makes it difficult to maintain 
electrical and ionic pathways during cycling. Finally, electrical 
contact between the nanoparticles is easily altered or diminished 
by volume changes during cycling, severely decreasing the cycle 
life of the electrode. To the best of our knowledge, stable cycling 
(100 cycles) with an areal capacity higher than 3 mAh cm 7 has 
not been reported. 

Here, inspired by the structure of a pomegranate fruit, we 
demonstrate a novel secondary structure for silicon anodes 
(Fig. 1a,b). Such a design has multiple advantages: (1) the nanosized 
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primary particle size prevents fracture; (2) the well-defined internal 
void space allows the silicon to expand without changing the sec- 
ondary particle size; (3) the carbon framework functions as an elec- 
trical highway and a mechanical backbone so that all nanoparticles 
are electrochemically active; (4) carbon completely encapsulates the 
entire secondary particle, limiting most SEI formation to the outer 
surface instead of on individual nanoparticles, which not only limits 
the amount of SEI, but also retains the internal void space for silicon 
expansion; and (5) the dilemma of high surface area and low tap 
density introduced when using nanosized primary features is par- 
tially solved. With this design, the specific SEI area (the surface 
area in contact with the electrolyte divided by the mass of silicon) 
decreases from ~90m?g ' for single yolk-shell particles to 
~15 m’g | for 1 ym secondary particles and to only 1.5m? g ' 
for 10 wm particles (Fig. 1c). At the same time, because of the 
space-efficient packing inside the secondary particles, their tap 
density is significantly higher than primary nanosized particles 
packed randomly. In the simplified demonstration shown in 
Supplementary Fig. 1, silicon nanoparticles have a tap density of 
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Figure 3 | Tuning the size of the void space of silicon pomegranates and in situ TEM characterization during lithiation. a-d, SEM images of as-synthesized 
silicon pomegranates with a void space size of 15 nm (a), 20 nm (b), 30 nm (e) and 40 nm (d), respectively. The definition of void space size is shown in 
Fig. 1b. e, Schematic of the in situ TEM device. f, Schematic and time-lapse images of the lithiation of silicon pomegranates with insufficient (~15 nm) void 
space (Supplementary Movie 1). Lithium transports along and across the carbon framework to react with the silicon inside, causing volume expansion. 
Because the void space is insufficient, the carbon framework is ruptured by the expansion of the silicon, and the overall morphology is destroyed. g, Lithiation 
of a silicon pomegranate with sufficient (~40 nm) void space (Supplementary Movie 2); the nanoparticles expand within the carbon framework and the 
carbon framework does not rupture. The secondary particle morphology is therefore intact on lithiation. 


0.15 gcm *, while micrometre-sized clusters have a tap density of 
0.53 gcm *, which represents an increase of 250%. This tap 
density is within the desirable range for electrodes because the 
volume expansion of silicon is up to 300%. 

We have developed a bottom-up microemulsion” approach to 
synthesize highly spherical silicon pomegranate microbeads 
ranging from 500nm to 10ym in diameter (Fig. 2a,b, 
Supplementary Figs 2-5). Silicon nanoparticles are individually 
encapsulated by the carbon framework, with well-defined void 
spaces between the silicon and carbon (Fig. 2c-e) that accommodate 
the volume expansion of the silicon. The carbon framework, 
although only a few nanometres thick, firmly supports the whole 
microbead (Fig. 2f, Supplementary Figs 6 and 7). X-ray photo- 
electron spectroscopy (XPS) results indicate complete coverage of 
carbon (Fig. 2g), which is crucial for blocking the electrolyte and 
limiting the majority of SEI formation to the outer surface of the 
microbeads. Our unique pomegranate design is fundamentally 
different from previous reports of silicon/carbon composite 
secondary particles!!’”8, First, the carbon component functions 
not only as a conducting framework, but also as an electrolyte- 
blocking layer, so SEI forms mostly outside the secondary particle. 
Second, the void spaces inside the secondary particle are well- 
defined and evenly distributed around each nanoparticle, and 
effectively accommodate the volume expansion of the silicon 
without rupturing the carbon shell or changing the secondary 
particle size. The initial well-defined void spaces do not decrease 


the volumetric capacity, because they are designed to be mostly 
occupied in the lithiated state. 

A sufficient and well-defined internal void space is necessary to 
maintain the structural integrity of the silicon anode. To identify the 
critical gap size, we synthesized silicon pomegranates with a series of 
gap sizes by tuning the thickness of the SiO, sacrificial layer 
(Fig. 3a-d). We then performed an in situ transmission electron 
microscopy (TEM) study of their structural change on lithiation. 
The in situ TEM set-up is based on previous studies?®*° and is 
shown schematically in Fig. 3e. For a silicon pomegranate with 
insufficient (~15nm) gap size, as shown in Fig. 3f and 
Supplementary Movie 1, the expanded Li-Si first occupies all the 
void spaces and then ruptures the carbon shell and causes structural 
degradation. This kind of crack formation in a conventional battery 
configuration will cause (1) excessive formation of SEI, (2) loss 
of electrical connection to the active material and (3) whole- 
electrode-level cracking due to accumulated particle shuffling 
across the thickness of the electrode. In the case of sufficient gap 
size (Fig. 3g and Supplementary Movie 2), however, silicon 
expands inside the carbon framework to occupy the void spaces, 
resulting in little change to both the carbon shell and secondary 
particle size. Taking into account the initial size of the nanoparticles 
(~80 nm) and their size distribution, silicon pomegranates with a 
~30-40 nm gap size are the most promising, so this gap size was 
chosen for further electrochemical characterization. The fact that 
the nanoparticles in the middle of the secondary particle are 
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Figure 4 | Electrochemical characterization of silicon pomegranate anodes. All specific capacities of silicon pomegranate anodes are based on the total 
mass of the active materials (silicon and carbon in the pomegranate structure). a, Reversible delithiation capacity for the first 1,000 galvanostatic cycles of 
the silicon pomegranate and other structures tested under the same conditions. Coulombic efficiency is plotted for the silicon pomegranate only. The mass 
loading of the active material was ~0.2 mg cm ~. The rate was C/20 for the first cycle and C/2 for later cycles. (IC=4.2 Ag’ | active material.) b, Voltage 
profiles for the silicon pomegranate plotted for the 3rd, 250th, 500th, 750th and 1,000th cycles. ¢, High areal mass loading test (up to 3.12 mg cm ° active 
material) of silicon pomegranate anodes. All electrodes were first cycled at 0.03 mA cm”? for three cycles and 0.7 mA cm ® for later cycles. d, Typical SEM 
images of silicon pomegranates after 100 cycles. e, Statistical analysis of silicon pomegranate diameter before and after 100 cycles, with averages of 3.1 and 


3.4 um, respectively. 


active upon lithiation suggests effective lithium transport along and 
across the carbon shell and framework. 

The pomegranate design affords remarkable battery perform- 
ance. As shown in Fig. 4a, its reversible capacity reached 
2,350 mAh g | at a rate of C/20 (1C = charge/discharge in 1 h). If 
not mentioned, all reported capacities are based on the total mass 
of silicon and carbon in the pomegranate structure. Because 
silicon is only 77% of the mass of the pomegranate structure, the 
capacity with respect to silicon is as high as 3,050mAhg ’. 
The volumetric capacity based on electrode volume was determined 
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to be 1,270mAhcm *, which is more than twice the 
600 mAh cm * obtained for graphite anodes®. From the 2nd to 
1,000th cycle at a rate of C/2, the capacity retention was more 
than 97%. After 1,000 cycles, over 1,160mAhg ' capacity 
remained, which is more than three times the theoretical capacity 
of graphite. The cycle stability (0.003% decay per cycle) is among 
the best cycling performances of silicon anodes reported to date. 
Furthermore, it was achieved with a conventional polyvinylidene 
fluoride (PVDF) binder, which has been considered a poor binder 
for silicon anodes!%, Under the same conditions, secondary particles 
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without an internal void space (nanoparticle clusters directly coated 
by carbon) demonstrated significant decay after 200 cycles. Bare 
nanoparticles decayed even more quickly. The voltage profiles of 
silicon pomegranate electrodes (Fig. 4b) exhibited typical electro- 
chemical features of silicon, with little change over 1,000 cycles. 
Coulombic efficiency is an indicator of the reversibility of the elec- 
trode reaction. SEI rupture and reformation usually results in 
decreased Coulombic efficiency, especially in later cycles. The 
average Coulombic efficiency from the 500th to 1,000th cycles of 
the silicon pomegranate is as high as 99.87% (Fig. 4a, 
Supplementary Fig. 8). At a relatively slow rate of C/2, this 
Coulombic efficiency is superior to most previous reports. 

It should be noted that many publications only report specific 
capacity normalized by the weight of the active materials, and low 
areal mass loading is frequently used to achieve stable cycling. 
However, high areal mass loading is needed to realize high perform- 
ance based on total cell weight or volume*!. We therefore tested 
thick silicon pomegranate electrodes with active material mass 
loading up to 3.12 mg cm? (silicon and carbon in a pomegranate 
structure). Following deep cycling at 0.03 mA cm ”, the reversible 
areal capacity reached 3.67 mAh cm ” (Fig. 4c), similar to, if not 
higher than, the capacity in a commercial lithium-ion battery cell. 
From the 4th to 100th cycle at a higher rate of 0.7 mA cm ”, the 
capacity retention was as high as 94%. After 100 deep cycles, the 
areal capacity was still above 3 mAh cm”. To the best of our knowl- 
edge, high capacity and stable cycling at such a mass loading level 
have rarely been reported for silicon anodes. At slightly lower 
mass loadings of 2.34 and 1.20mgcm *, 150 and 200 stable 
cycles were achieved. The specific capacity of the high mass 
loading cell (950 mAh g _’) is only slightly lower than that of the 
low mass loading cell shown in Fig. 4a, which indicates that 
almost all the silicon pomegranate is active in the thick electrode. 
Further high loading cells also demonstrated stable cycling 
(Supplementary Fig. 9). Stable cycling of a high mass loading 
cell has strict requirements in terms of structural stability at 
the particle level, because even small changes in particle mor- 
phology could accumulate across the thickness of the electrode 
and cause electrode-level cracking and failure. Excellent 
performance at high mass loading indicates successful design of 
the pomegranate structure. 

There are two interdependent characteristics of the pomegra- 
nate design that enable superior battery performance. The first 
is the internally accommodated volume expansion, which retains 
the structural integrity of the secondary particles and stabilizes 
the SEI on the surface. After 100 deep cycles, the morphology 
of the silicon pomegranates was examined with scanning electron 
microscopy (SEM; Fig. 4d). Nearly perfect spherical micrometre- 
sized particles were observed, with thin and uniform SEI coating 
the surface. After removing the SEI with acid, intact carbon 
shells with silicon inside are clearly visible. Statistical analysis of 
the diameter of the silicon pomegranates before and after 
cycling (Fig. 4e) shows only a 10% increase from 3.1 wm to 
3.44m due to the formation of an SEI layer. The volume 
change here is much less than that of bare alloying anode, and 
is as small as intercalation-type graphite particles®, which is 
crucial for excellent performance in volume change-sensitive, 
high mass loading cells. 

The second characteristic is spatially confined SEI formation. 
Our pomegranate particles are fundamentally different from 
previously reported secondary particles that have open inner 
surfaces!!?78, in that they have an electrolyte-blocking layer that 
limits most of the SEI formation to the surface of the secondary 
particle (Supplementary Figs 10 and 11). Also, the internally accom- 
modated volume expansion ensures that the SEI on the secondary 
particle is thin and stable. This mechanism not only decreases the 
quantity of SEI, but also enables the inner void space to be retained, 


even after many cycles. As shown in Fig. 4e, the thickness of the 
outer SEI layer is only ~150 nm, similar to that of double-walled 
silicon nanotubes”’. In addition, because the SEI is on a micro- 
metre-sized secondary particle, the relative quantity of SEI is signifi- 
cantly reduced compared to that on primary nanostructures 
(Fig. 1c). The thin, stable and spatially confined SEI increases the 
Coulombic efficiency and improves cycle stability. 

An additional advantage of silicon pomegranates is that they exist 
in the form of powders, and their synthesis does not involve any 
complex equipment or processes such as chemical vapour depo- 
sition. Hence, they are entirely compatible with conventional 
slurry-coating manufacturing for lithium-ion battery electrodes. 
However, we note that future work is needed to reduce the cost of 
starting materials such as the silicon nanoparticles to meet commer- 
cial needs. The initial Coulombic efficiency is 82% for 9% carbon, 
but decreases with increasing thickness of the carbon coating 
(75% Coulombic efficiency at 23% carbon) (Fig. 2h, 
Supplementary Fig. 12), and is still lower than state-of-the-art 
graphite anodes due to the high density of lithium trapping sites 
in amorphous carbon (Supplementary Fig. 13). It still needs to be 
improved further, possibly by performing prelithiation® or repla- 
cing amorphous carbon with another material that does not 
irreversibly trap large amounts of lithium. 


Methods 

Synthesis. As illustrated in Fig. 2a, commercial silicon nanoparticles were first 
coated with a SiO, layer using tetraethoxysilane”!. The aqueous dispersion of 
Si@SiO, nanoparticles was then mixed with 1-octadecene containing 0.3 wt% 
emulsifier”® to form water-in-oil emulsions. After evaporation of water at ~95-98 °C, 
the assembled Si@SiO, nanoparticle clusters were collected by centrifugation, 
followed by heat treatment at 550 °C for 1 h in air to remove the organics and 
condense the cluster structures. A low-cost, step-growth polymerization in the 
presence of ammonia then generated a resorcinol-formaldehyde resin (RF) layer to 
wrap the cluster, which was converted into a carbon layer under argon at 800 °C. 
The thickness of the carbon layer could be tuned by changing the added amount of 
resorcinol monomer™. Finally, the SiO, sacrificial layer was removed with 5 wt% HF 
solution to form a void space to accommodate the large volume change of silicon 
material during the charge/discharge process. For more details, see Supplementary 
Materials and Methods. 


In situ TEM. A specialized dual-probe electrical biasing holder (Nanofactory 
Instruments) was used. By biasing the working electrode between —2.5 and —3 V 
versus the counter electrode, Li* ions flow through the oxide/nitride layer and are 
reduced at the working electrode, where they react with carbon and alloy with the 
silicon in the pomegranate structure. The lithiation time of a silicon pomegranate 
structure is less than 30 min (Fig. 3g), corresponding to 2C. The small size of the 
primary nanoparticles and the conductive carbon framework synergistically lead to 
good kinetics, as also evidenced by the rate performance of half cells shown in 
Supplementary Fig. 14. 


Electrochemistry. The battery performance was evaluated by galvanostatic cycling 
of coin cells with the silicon pomegranate structures as the working electrode and 
lithium foil as the counter/reference electrode. The working electrodes were made 
using a typical slurry method with silicon pomegranate powders, carbon black and 
PVDF binder with a mass ratio of 8:1:1; the mass loading of active material (silicon 
and carbon in the pomegranate structure) was ~0.2 mg cm. * All the cells were 
cycled between 0.01 and 1 V versus Li/Li*. To prepare high mass loading electrodes, 
as shown in Fig. 4c, silicon pomegranate microbeads and carbon nanotubes (mass 
ratio 7:3) were dispersed in N-methyl-2-pyrrolidone (NMP) and filtered to make a 
binder-free microbead/carbon nanotube paper, which was cut into discs, with mass 
loading from 1 to 3 mg cm *. The electrolyte was 1.0 M LiPF, in 1:1 wt/wt ethylene 
carbonate/diethyl carbonate, with 1 vol% vinylene carbonate added to improve the 
cycling stability. This type of filtered electrode has a density of ~0.4gcm °. The 
thickness is ~120 jzm when the silicon pomegranate loading is 3.12 mg cm ~. The 
volumetric capacity based on electrode volume is 310 mAh cm_*. This value is lower 
than graphite, but it should be noted that such filtered electrodes are not calendered 
and 30% carbon nanotubes increases the volume of the electrode. They are 
intentionally not calendered to show the excellent and robust intraparticle conductivity 
of the silicon pomegranates without the need for pressure. To examine whether the 
silicon pomegranate could perform well in denser electrodes, we also tested high- 
loading electrodes fabricated using an industry-viable slurry coating and calendering 
process with much fewer carbon nanotubes (5%) or no nanotubes. These electrodes 
also demonstrated stable cycling at high areal capacity (Supplementary Fig. 9), as well as 
much higher volumetric capacity (~900-1,270 mAh cm *) than the practical value 
for graphite anodes (<600 mAh cm’). 
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The role of graphene for electrochemical 
energy storage 


Rinaldo Raccichini'2?, Alberto Varzi2?, Stefano Passerini*** and Bruno Scrosati*** 


Since its first isolation in 2004, graphene has become one of the hottest topics in the field of materials science, and its highly 
appealing properties have led to a plethora of scientific papers. Among the many affected areas of materials science, this ‘graphene 
fever’ has influenced particularly the world of electrochemical energy-storage devices. Despite widespread enthusiasm, it is not 
yet clear whether graphene could really lead to progress in the field. Here we discuss the most recent applications of graphene — 
both as an active material and as an inactive component — from lithium-ion batteries and electrochemical capacitors to emerg- 
ing technologies such as metal-air and magnesium-ion batteries. By critically analysing state-of-the-art technologies, we aim to 
address the benefits and issues of graphene-based materials, as well as outline the most promising results and applications so far. 


lattice, was for a long time considered to be merely a build- 

ing block for carbonaceous materials of other dimensionali- 
ties (that is, graphite, fullerenes and carbon nanotubes)'. Initially 
labelled as an ‘academic material, graphene was thought not to exist 
in a free state until 2004, when Novoselov and co-workers isolated a 
single-atom-thick layer of carbon’. Since then, interest in graphene 
has grown continuously, giving rise to what might be called the ‘gra- 
phene gold rush’. Recently, intense research efforts — motivated 
by graphene’s many appealing properties — have been boosted by 
multimillion-dollar funding from both the European Union and 
China’. Despite its wide range of potential applications* and very 
promising array of features® with respect to other structurally differ- 
ent forms of carbon (Table 1)°*, it is not yet clear whether graphene 
has the potential to revolutionize many aspects of our lives. In recent 
years, a large number of publications have discussed the application 
of graphene in electrochemical energy-storage devices (EESDs). 
However, although such discussions always highlight the advan- 
tages of graphene, they often lack an objective analysis of its limita- 
tions and drawbacks. This leaves us with a number of key questions. 
Will the employment of graphene be limited to niche applications, 
or will next-generation batteries and capacitors be graphene-based? 
Graphene’ properties vary strongly as a function of its production 
method. Hence, which typologies of graphene can be produced with 
today’s available technologies? Could these significantly outperform 
state-of-the-art materials? Furthermore, which performance met- 
rics are more relevant for predicting the potential use of graphene in 
EESDs? This Progress Article aims to address these open questions. 


(5 raphene, a carbon monolayer packed into a 2D honeycomb 


Properties and production methods 

Graphene — a defect-free flat carbon monolayer — is the only basic 
member of a much larger family of 2D carbon forms. As carefully 
reviewed in a Carbon Editorial’, this ‘graphene family’ includes 
materials with very different properties in terms of morphology, lat- 
eral dimensions, number of layers and defects (Tables 2 and 3)}”*. 
Among these characteristics, the presence of defects is the factor 
that primarily affects the quality of the end material® and, conse- 
quently, its electrochemical features. The methods adopted for 
graphene production>®’, the most common shown in Fig. 1, play 
a crucial role in determining the properties of the final product. 


Owing to limited scalability and high production costs, methods 
such as mechanical exfoliation”"°, synthesis on SiC*!° and bottom- 
up synthesis from structurally defined organic precursors*'” neces- 
sarily restrict the use of graphene to fundamental research and niche 
applications, such as touch screens and high-frequency transistors. 
Similarly, chemical vapour deposition of hydrocarbons’, although a 
well-established technique in industry, seems generally unsuitable 
for mass-production of graphene for electrochemical energy stor- 
age because of its high cost, moderate product purity and rather 
low yield'®. Nevertheless, chemical vapour deposition has been 
reported as an efficient method for producing vertically oriented 
graphene nanosheet electrodes'', although the packing density 
of the as-obtained graphene is very low’. Beyond the aforemen- 
tioned techniques, two methods are widely employed for the bulk 
production of graphene: liquid-phase exfoliation, and reduction of 
graphene oxide. In liquid-phase exfoliation, pristine or expanded 
graphite particles, obtained by thermal expansion of graphite 
intercalation compounds (usually known as ‘expandable graph- 
ite’), are first dispersed in a solvent to reduce the strength of the 
van der Waals attraction between the graphene layers. Afterwards, 
an external driving force such as ultrasonication”, electric field" or 
shearing’ is used to induce the exfoliation of graphite into high- 
quality graphene sheets*!*. Unfortunately, the low yield of this pro- 
cess leaves a considerable amount of unexfoliated graphite, which 
must be removed'». Nevertheless, the high scalability and low cost of 
liquid-phase exfoliation’? make it suitable for producing graphene 
in bulk quantities’®. In the second method, graphene oxide (GO), 
a highly defective form of graphene with a disrupted sp?-bonding 
network, is produced by strong oxidation of pristine graphite’”"® 
followed by stirring or ultrasonication in liquid media’. Graphene 
oxide must be reduced in order to restore the 7 network, which is 
the characteristic of conductive graphene”. Chemical, thermal and 
electrochemical processes are commonly employed in this order to 
produce reduced graphene oxide (RGO)!°”°”!. Despite the low-to- 
medium quality of the obtained material due to the presence of both 
intrinsic defects (edges and deformations) and extrinsic defects (O- 
and H-containing groups), these methods allow the production of 
bulk quantities with high yield and contained costs. Although liq- 
uid-phase exfoliation and reduction of GO are the primary methods 
for producing commercially available graphene for EESDs, other 
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Table 1 | Graphene properties compared with other carbonaceous materials. 


Graphene Carbon nanotube Fullerene Graphite 

Dimensions 2 1 0 3 

ybridization sp? Mostly sp? Mostly sp? sp? 

ardness Highest (for single layer) High High High 
Tenacity Flexible, elastic Flexible, elastic Elastic Flexible, non-elastic 
Experimental SSA (m?g™) ~1,500 ~1,300 80-90 ~10-20 
Electrical conductivity (S cm) ~2,000 Structure-dependent Ope Anisotropic: 2-3 x 10**, 6¢ 
Thermal conductivity (W mK") 4,840-5,300 3,500 0.4 Anisotropic: 1,500-2,000*, 5-10° 


*a direction, ‘c direction. 


Table 2 | Dimension-based graphene nomenclature. 


Thickness (n, number of layers) Lateral dimension D (nm) Aspect ratio (length:width) 


1 2<ns<10 D<100 100 <D<10° <10 >10 
Single-layer Few-layer Nano- Micro- -Sheet -Ribbon 
monolayer multilayer -Flake 

-Plate 

-Platelet 
Table 3 | Graphene's structural defect typologies. 
Intrinsic Extrinsic 


(removal or introduction of carbon atoms in graphene’s chemical composition) 


(introduction of non-carbon atoms in graphene’s chemical composition) 


Vacancies 

Edges 
Deformations 
Hybrid structures 


techniques are available (such as carbon nanotube unzipping” or 
direct arc-discharge”). However, owing to their higher costs, these 
techniques remain relatively marginal and thus unsuitable for 
bulk production. 

In their review, Novoselov et. al.* perfectly summarized the cur- 
rent state of affairs: “Graphene will be of even greater interest for 
industrial applications when mass-produced graphene has the same 
outstanding performance as the best samples obtained in research 
laboratories.” As a matter of fact, the large-scale production of ‘out- 
standing performance’ graphene is the most ambitious challenge to 
address before its widespread application®. This aspect is particu- 
larly relevant with regard to the introduction of graphene in EESDs 
for powering millions of electric cars in the near future. 

Over the past few years, many studies have explored graphene- 
based materials for electrochemical energy storage™. In most of 
these, graphene was produced from graphite. As shown in Fig. 2, 
expandable graphite can be thermally expanded and subsequently 
exfoliated to obtain graphene. Pristine graphite can also be directly 
exfoliated to give graphene through liquid-phase methods or, alter- 
natively, oxidized to obtain graphite oxide**”*. The latter, after liquid- 
phase exfoliation, yields GO, which is then reduced to form RGO”. 
This approach is different from other types of application as it is par- 
ticularly useful for energy-storage materials. In fact, although oxida- 
tion introduces defects that cannot be entirely removed during the 
reduction process”, this synthetic pathway facilitates the prepara- 
tion of composites. In contrast with graphene (including RGO), GO 
can be easily dispersed in a wide range of solvents'’. This peculiar- 
ity enables, through different chemical routes, the functionalization 
of GO with electroactive materials (such as conductive polymers 
and metal oxides) to form GO-based composites””. These compos- 
ites can be used as such, or alternatively can be further reduced to 
obtain RGO-composites**. 

Graphene-based materials have been proposed for use in all 
kinds of EESD, either as an active material or an inactive component. 


272 


O, H and other foreign atoms 


Graphene as an active material 

Graphene can be considered to be an active material when it takes 
part in an energy-storage mechanism. This can range from hosting 
ions (such as Li* or Na* in metal-ion batteries) to storing electro- 
static charges on the electrode double-layer (as in electrochemical 
double-layer capacitors, EDLCs), or functioning as a catalyst in 
metal-air batteries. 


Lithium-ion batteries. In lithium-ion batteries (LIBs), Li* ions con- 
tinuously shuttle between a lithium-releasing cathode (commonly a 
layered lithium metal oxide) and a lithium-accepting anode (com- 
monly graphite)”. The amount of ions hosted per gram of material 
determines the capacity — and thus the energy — of the battery. 
Similar to graphite, graphene can be used as an anode for hosting 
Li*, both as such and as a carbonaceous matrix in composites with 
other materials also capable of storing lithium. 


Graphene as an Li* host. As originally suggested by Dahn et al. in 
1995, an anode comprising single layers of graphene can host two 
times as many Li* ions as conventional graphite**!. The storage of 
one lithium ion on each side of graphene results in a Li,C, stoichiom- 
etry that provides a specific capacity of 744 mAh g"! — twice that of 
graphite (372 mAh g")**. This primeval concept of lithium hosting 
in graphene-like carbons was retrieved following the first isolation 
of graphene in 2004’. Differently from graphite, in which lithium 
is intercalated between the stacked layers”, single-layer graphene 
can theoretically store Li* ions through an adsorption mechanism, 
both on its internal surfaces and in the empty nanopores that exist 
between the randomly arranged single layers (accordingly to the 
‘house of cards’ model)***!. Similarly to other disordered carbons, 
such a process mainly takes place at low potentials (<0.5 V versus 
Li/Li*). However, it differs from the characteristic staging behaviour 
of graphite because graphene provides electronically and geometri- 
cally non-equivalent sites*’. As a result of this unique mechanism, 
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the amount of lithium stored by graphene-based anodes is more 
strongly dependent on the production method of both the material 
and the electrode. 

In most reported studies, RGO is the material of choice for 
lithium-ion storage*’. During the first Li* insertion, RGO exhibited 
incredibly high-capacity values of >2,000 mAh g” (ref. 33), which 
is higher than the theoretical capacity of single-layer graphene. 
However, this amazing capacity is not fully released after de-inser- 
tion due to the massive irreversibility of the first lithiation step®. 
This phenomenon, also observed for other Li-ion anode materi- 
als***, can be attributed mainly to the irreversible reduction of the 
electrolyte to form a surface passivation layer on the active particles; 
namely, the ‘solid electrolyte interphase’*’. As shown in Fig. 3a, the 
solid electrolyte interphase strongly depends on the specific surface 
area (SSA) of the active material. Accordingly, the extremely high 
SSA of graphene, when compared with common graphite (Table 1), 
results in a very high initial irreversible capacity® (Fig. 3b). In the 
following de-insertion cycle, graphene displays a high reversible 
capacity, although delivered mostly at potentials of 1-3 V versus 
Li/Li*, which is rather higher than typical graphite values (0-0.4 V 
versus Li/Li*). This leads to the occurrence of a large voltage hyster- 
esis upon insertion and de-insertion of Li* (Fig. 3b), which results 
in poor energy efficiency for cells employing such electrodes. Such 
a drawback, together with the large cathode quantities needed to 
supply the initial charge for the irreversible capacity, makes gra- 
phene-based cells unfeasible. The voltage hysteresis, also observed 
in several nanotube-shaped materials* and high-specific-charge 
carbons”, is caused, among other reasons, by Li storage on defects 
such as edges and/or oxygen- and hydrogen-containing surface 
groups”. It is thus advisable to limit the number of such defects in 
graphene-based anodes, particularly because they are also responsi- 
ble** for the low Coulombic efficiency in the first cycle. In addition, 
the progressive reduction of oxygen-containing groups (for exam- 
ple, in RGO) leads to graphene layer re-stacking, which lowers the 
storage capacity over repeated cycling*’. All of these aspects affect 
the value of the reversible capacity, which, after a few tens of cycles, 
is rarely comparable to that of commercially available graphites®. 

Graphene quality is therefore a crucial issue that must be 
addressed before the graphite in LIBs can be replaced. Even when 
graphene is finally available in large quantities at reasonable cost, 
graphite will probably still be the active material of choice for wide- 
spread hard-case batteries, unless we develop effective strategies to 
prevent initial lithium ion consumption and avoid graphene layer 
re-stacking. In this regard, pre-lithiation®**, controlled surface 
functionalization® and the use of composites® might be promising 
strategies. At the same time, the development of flexible LIBs, which 
require lightweight and ultrathin active materials, could benefit from 
the use of graphene. However, even if different studies demonstrate 
graphene as a promising anode in flexible LIBs, the aforementioned 
drawbacks still represent major obstacles for practical applications”. 


Graphene-based composite anodes. Several composites have recently 
been developed in an effort to overcome the energy-storage limita- 
tions and poor cycling behaviour of bare graphene negative elec- 
trodes*. The addition of electroactive materials, such as metal (or 
metal oxide) nanoparticles, provides reversible alloying (with SnO, 
or Si nanoparticles), insertion (with TiO,) or conversion (with 
Fe,O, or Co,O,) reactions with lithium, thus allowing considerably 
higher storage capacities than those of bare graphene or graphite**’. 
During the composite preparation, graphene can act as a support 
for the growth of electroactive nanostructures that, in turn, hinder 
re-stacking by lowering the van der Waals forces among the layers. 
As a result, graphene-based composites are less affected by agglom- 
eration during electrode preparation, as well as by capacity fading 
during cycling®. Moreover, the extensive and highly conductive car- 
bon matrix established by graphene layers improves the electrical 
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Mechanical exfoliation 


Reduction of 
graphene oxide 


Liquid-phase 
exfoliation 


Bottom-up 
synthesis 


O = None or N/A 


1= Low 
Chemical vapour 2 = Average 
deposition 3 = High 


Figure 1| Schematic of the most common graphene production methods. 
Each method has been evaluated in terms of graphene quality (G), cost 
aspect (C; a low value corresponds to high cost of production), scalability 
(S), purity (P) and yield (Y) of the overall production process. 


conductivity of the composite and buffers eventual volume changes 
taking place in electrodes based on alloying or conversion materi- 
als during cycling”. Despite these promising properties, however, 
graphene-based composites suffer, similarly to bare graphene, from 
the huge irreversible charge consumption of 30-50% during the 
first charge/discharge cycle**. Results achieved so far with gra- 
phene composite anodes are very encouraging towards not only the 
development of high-energy LIBs, but also future applications such 
as wearable EESDs*. Among the proposed composite graphene- 
based materials, some of the most promising in terms of reversible 
capacity are Co,0,/RGO (1,500 mAh g”)*, silicon nanoparticles/ 
RGO (1,150 mAh g")*, N- and S- co-doped RGO (900 mAh g")* 
and SnO,/RGO (700 mAh g")**. Nevertheless, the optimization of 
structural arrangement and weight ratio distribution between the 
composite components are still key issues that must be addressed to 
achieve good electrochemical performance and extended cycle life’. 


Sodium-ion batteries. The development of sodium-ion batteries 
(SIBs), seen as a cheaper alternative to LIBs, is promoting extensive 
research to identify a suitable anode active material because, owing 
to their large ionic radius, Na* ions do not intercalate into graphite*” 
(Fig. 3c). In this regard, graphene seems to be a good candidate as 
an active anode in SIBs. 


Graphene as an Na* host. The use of RGO as an anode material in 
SIBs was first reported in 2013", where it showed promising elec- 
trochemical behaviour, good cycle life and excellent rate capability. 
Such remarkable performance is related to the presence of defects 
(for example, residual oxygen-containing groups), which increase 
the graphene interlayer distance (0.37 nm, compared with 0.34 nm 
in graphite). However, as observed in LIBs, the presence of defects 
represents a serious drawback in term of Coulombic efficiency for 
SIBs"*. Recently, Ding et al. reported the synthesis of different kinds 
of few-layer graphene (produced from biomass precursors) and their 
performance as anode materials in SIBs”. Interestingly, the obtained 
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Figure 2 | The most common synthetic pathways for producing graphene-based materials (GO, RGO, GO- and RGO-based compounds) for use as 


electrode active materials in EESDs. 


graphenes exhibited different Na* ion host mechanisms depending 
on the synthesis temperature (600-1,400 °C). Lower temperatures 
yielded average-quality graphenes with an Na* storage capacity 
similar to that of RGO. In contrast, higher temperatures enabled the 
formation of better quality graphene, with an interlayer spacing of 
0.38 nm and promising insertion performance. Indeed, this report 
discloses one of the best-performing graphene-like materials for 
SIB anodes”’, showing up to 300 mAh g” specific capacity and good 
retention over 200 cycles, even though the Coulombic efficiency for 
the first cycle remains poor. Such results give hope for the success- 
ful employment of graphene in SIBs, insofar as it can compete with 
other recently developed anode materials*. Additionally, the lower 
insertion potential of graphene-based anodes makes it more advan- 
tageous in terms of specific energy. 

Similarly to LIBs, graphene-based composites enable SIBs with 
higher specific capacity, better rate capability and longer cycle life 
than bare graphene*!*. 


Electrochemical capacitors. Electrochemical capacitors (also called 
supercapacitors) exploit fast charge-storage mechanisms to enable 
considerably higher power densities than those available in LIBs or 
SIBs. Electrochemical capacitors can be subdivided into two classes: 
electrochemical double-layer capacitors (EDLCs) and pseudocapaci- 
tors. In EDLCs, the energy is physically stored through the adsorption 
of ions on the surface of the electrodes, whereas in pseudocapacitors, 
electrochemical energy storage is enabled by fast redox reactions 
occurring between the electrode active material and the electrolyte”. 


Electrochemical double-layer capacitors. In EDLCs, the electrode’s 
active materials are electrochemically stable, do not undergo any 
Faradaic processes and, above all, possess large SSAs°*. The amount 
of charge stored per unit mass (F g'), volume (F cm”) or area 
(F cm”) is indeed directly proportional to the surface available for 
the formation of the double layer (that is, the area in contact with 
the electrolyte)*”. In principle, graphene, with its theoretical SSA of 
2,675 m* g" (ref. 8) and capacitance of 550 F g” (ref. 58), would be a 
perfect candidate for boosting the energy density of such devices®. 
However, this does not seem to be the case in practice, as the difficulty 
of even approaching the theoretical SSA of graphene (for instance, 
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average values for RGO are in the range of 300-1,000 m? g*)* results 
in a lower practical gravimetric capacitance (100-270 F g™ and 
70-120 F g“ with aqueous and organic electrolytes, respectively). 
Additionally, spontaneous graphene layer re-stacking, which occurs 
during both electrode manufacturing and cycling, strongly reduces 
the practical surface available for charge storage (Fig. 3d,e). Different 
approaches have been introduced to mitigate these detrimental 
effects. As reported by Ruoff et al., RGO can be chemically activated 
to create an extended 3D meso- and microporous network (with an 
SSA of up to 3,100 m? g") of highly curved graphene walls that pre- 
vent re-stacking during cycling. Such ‘activated graphene’ enables 
high gravimetric capacitances with both organic (166 F g") and ionic 
liquid electrolytes (200 F g*t)® and, moreover, operates across a wide 
temperature range of —50 °C to 80 °C. Alternatively, graphene layer 
re-stacking can be minimized by optimizing the electrode-manufac- 
turing process. In this regard, RGO sheets could be vertically aligned 
with respect to the current collector plane, thus granting better ion 
accessibility and enabling higher packing density. Moreover, high 
and reversible volumetric (171 F cm”) and areal (1.83 F cm”) capac- 
itances in aqueous electrolyte could be obtained”. 

In summary, although activated graphene and vertically aligned 
RGO show promising performance, the large majority of graphene- 
like materials cannot yet compete with the cheaper and well-estab- 
lished activated carbons”. The majority of the results reported for 
graphene-based supercapacitors were obtained with very low den- 
sity electrode materials (for example, aerogels and foams), which 
possess a large number of void spaces (macropores). These pores 
are filled by the electrolyte, thus increasing both the weight and 
volume of the final device to a point where they are unsuitable for 
use in EDLCs***. In contrast, graphene would probably fit in the 
approaching era of small-scale supercapacitors required to power 
the next generation of wearable- and micro-electronic devices™. 


Pseudocapacitors. In pseudocapacitors, the presence in the active 
material of electroactive species such as oxygen-containing func- 
tional groups, conducting polymers or transition metal oxides ena- 
bles higher energy densities with respect to EDLCs**”. Nevertheless, 
pseudocapacitors are inferior to EDLCs in terms of power density 
(limited by the poor electrical conductivity of the active materials) 


NATURE MATERIALS | VOL 14 | MARCH 2015 | www.nature.com/naturematerials 


© 2015 Macmillan Publishers Limited. All rights reserved 


NATURE MATERIALS bor: 10.1038/NMAT4170 


PROGRESS ARTICLE 


a b c ‘ 
A 
Crey/ Cre G r 
ee A raphene fe 
< SS + , Graphene; 
a 2 @Li @Na ' 
— Vv 1 
3 | & + 0 gS | 
B s| ef 
2 s| ~ 9 Gee: 
8 a ' 
g y = Graphite | 
. oO + 1 
ssA Li’ Na ; 
Tee e- ex: 
i T 1 > 
Defects 1,000 2,000 lonic radius 


Specific gravimetric capacity (mAh g") 


d . Layers re-stacking 
Electrode for example in RGO : 
-_ Sach . 2M* +O, + 2e° ——>M,0, 
: S = manufacturing 
a) SS 
a “R, —y 
EF a 

OO SEE Decrease of 

double-layer 


capacitance 


Capacitance 


Electrochemical 


Surface group 
degradation 
(for example in GO) 


Decrease of 
pseudocapacitance 


cycling Voltage 


M = Li, Na 


Figure 3 | Features and limitations of graphene as an active material in different EESDs. a, Graphite and graphene in LIB anodes. Correlation of 


characteristics in terms of defect amount, SSA and ratio between reversible (C...) and irreversible (C, 


) capacity during the first charge/discharge cycle. 
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graphene and graphite. d, Layers re-stacking in graphene during electrode manufacturing and electrochemical cycling. Re-stacking is a serious issue that 
affects the performances of all graphene-based EESDs. e, Generic voltammetric behaviour of graphene-based electrochemical capacitors over prolonged 
cycling. Top: Effect of graphene layers re-stacking (such as in RGO) on the double-layer capacitance. Bottom: Effect of surface group degradation (such as in 
GO) on pseudocapacitance. f, Catalytic effect of graphene defects (vacancies, deformations and presence of surface groups) in metal-air batteries. 


and cycle life. In this regard, graphene-based electrodes could 
be viable candidates for improving the performance of pseudoca- 
pacitors™. Despite its lower electrical conductivity, GO, owing to its 
large number of oxygen-containing groups, has a higher pseudo- 
capacitance than RGO®. However, as previously discussed, these 
groups may negatively affect the electrochemical behaviour of 
the electrode by reducing the cycling stability and reversibility”® 
(Fig. 3e). Various graphene-conducting polymer and graphene- 
metal-oxide composites have also been developed and investi- 
gated for use as pseudocapacitors®”. In these composites, graphene 
provides a support matrix for the growth of the electroactive species 
at the nanoscale, which results in a larger SSA and thus enhances the 
electrochemical performance by increasing electrical conductivity 
and mechanical stability. 

It seems the key to exploiting the full potential of graphene in 
pseudocapacitors relies on the development of composite materials 
that offer the synergistic effect of the graphene substrate and the elec- 
troactive component, along with an optimized spatial orientation of 
the graphene sheets!**?. 


Lithium-air batteries. The growing demand for energy has led to 
the development of new EESDs with higher energy densities than 
metal-ion batteries. In this regard, the lithium-air battery (LAB), 
which offers a theoretical energy density of 5,200 Wh kg” (ref. 67), 
represents one of the best candidates. Although lithium-air chem- 
istry was introduced in 1976, the rechargeability of this system was 
brought to the attention of the scientific community only in 2006 
by Bruce and colleagues. Although different LABs may employ 
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different typologies of electrolyte, they are generally composed of 
metallic lithium and oxygen (or air) as, respectively, the anode and 
cathode. The rechargeability of the system relies on the conversion 
of reduction products (LiO, and, mainly, Li,O,) formed during dis- 
charge (oxygen-reduction reaction), back to the original reagents 
during charge (oxygen-evolution reaction). Unfortunately, the 
entire system suffers from a low energy efficiency, short lifetime and 
low rate capability (discharge capacity of 400 mAh g"™ after 50 cycles 
at a specific current of 100 mA g™)**-”. Reports indicate a maximum 
of only 100 capacity-limited (1,000 mAh g") cycles’. Among the 
various factors that influence the performance of LABs, the mor- 
phology of the air electrode (cathode) is particularly important for 
obtaining high discharge capacity. In fact, the SSA and porosity of the 
air electrode determine the morphology and amount of discharge 
products. It was demonstrated that RGO, with its large SSA, could 
deliver higher capacities than other carbon substrates (for example, 
8,700 mAh g” with respect to 1,000-2,000 mAh g” in the first cycle). 
Defects and functional groups can also play a catalytic role for the 
formation of discharge products” (Fig. 3f). So far, the use of RGO as 
a bare material or substrate for other catalyst” in LAB cathodes has 
improved. performance, although achieving the theoretical energy 
density is still far away. Different aspects are still unclear and further 
studies are needed to demonstrate an effective role of graphene in 
LABs. Further investigations of graphene with stable electrolytes are 
needed before we can assess its effective role in such batteries”. 


Sodium-air batteries. Over the past five years, sodium-air bat- 
teries (SABs), despite having an energy density half that of LABs, 
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have been increasingly proposed owing to their low production cost 
and the availability of the required raw materials”. In contrast with 
lithium, sodium is capable of reversibly forming during discharge 
a stable superoxide (NaO,) with low overpotentials”. This enables 
SABs to cycle with a charge efficiency of 80-90% after the first cycle. 
The formation of peroxide (Na,O,), however, is kinetically hindered 
as it requires a suitable catalyst. RGO has demonstrated, under dry 
air conditions, remarkable catalytic properties towards the forma- 
tion of Na,O, (ref. 74), which are not exhibited by conventional car- 
bon. As reported by Liu et al.™, the micro- and nanostructures of the 
graphene air electrode enable one of the highest specific discharge 
capacities for SABs. These results suggest that RGO can efficiently 
function as a catalyst for both oxygen-reduction and oxygen-evo- 
lution reactions. Nitrogen-doped RGO nanosheets have also been 
investigated in this respect. The defective sites introduced by nitro- 
gen doping enable a more uniform and smaller size distribution of 
the discharge products and, therefore, a higher specific discharge 
capacity with respect to the undoped graphene”. Even though 
this technology is in the very early stages of development, the few 
reports available in this field depict a quite promising scenario for 
graphene in SABs. 


Graphene as an inactive component 

Graphene could play an important role in EESDs, even without 
being actively involved in the electrochemical reaction. Owing to 
its impressive electrical conductivity (Table 1), graphene was pro- 
posed as a conductive agent in metal-ion battery electrodes as well 
as an encapsulating carbon matrix in, for example, lithium-sulphur 
batteries. Besides enabling efficient electron transport, its superior 
thermal conductivity (Table 1) may be advantageous for dissipating 
the heat generated in the case of high current loads or/and abuse 
conditions’. This would result in devices with improved intrinsic 
safety. The variety of structures reported can be classified into six 
different models (Fig. 4a)°. 


Lithium-ion batteries. LiCoO,, LiMn,O, and LiFePO, (hereafter 
referred to as LMO) are some of the most commonly used cath- 
ode materials in LIBs. The cycle life and rate capability of these 
materials are generally limited by their poor electrical conductiv- 
ity. Introducing low-cost conductive additives (for example, carbon 
black) into the composite electrodes commonly solves this issue. 
Nevertheless, owing to their amorphous structure, carbon blacks 
possess a rather low electrical conductivity, with respect to more 
crystalline carbons such as graphene®”®. 

Recently, Kucinskis et al.’ reviewed state-of-the-art graphene- 
based composite cathode materials. Among the vast number of 
reports, most employ GO as a source for the formation of graphene 
conductive networks. Additionally, in a large part of these works, 
GO is reduced to RGO simultaneously with the LMO precur- 
sors (one-pot synthesis) to produce graphene-based composites. 
This approach, which is different from simply mixing the carbon 
additive with the LMO active material during electrode preparation, 
promotes the formation of small-size LMO particles (leading to 
improved Li* diffusion) directly onto the RGO matrix”. Moreover, 
the RGO 3D network is reportedly capable of preventing the disso- 
lution of some LMOs*, thus extending the cycle life of the batteries. 
However, it was also suggested that when RGO is mixed in a manner 
similar to conventional carbon additives during electrode prepara- 
tion, it could negatively affect the Li* mobility, thus worsening the 
electrochemical performance of the composite cathode” (Fig. 4b). 
Regardless of this fact, RGO is generally reported to enhance the 
rate capability of the cathode with respect to conventional carbon 
additives. Depending on the active material, improvements of up to 
160% of the discharge capacity (at the same current rate) have been 
observed”*. Nevertheless, it is not yet clear whether RGO may replace 
carbon blacks, which are much cheaper and easier to handle”. 
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Sodium-ion batteries. As explained previously for the negative 
electrode, the larger size of Na* with respect to Li* restricts the choice 
of available cathode material for SIBs‘’. Several layered oxides with 
promising electrochemical performance have recently been devel- 
oped®. However, like their LIB analogues, they usually possess poor 
electrical conductivity and thus limited rate capability. So far, only 
a few reports are available on graphene-containing composite cath- 
odes for SIBs’”-*°. However, in all cases, the RGO matrix seems to 
enhance the electrical conductivity of the composite, thus improv- 
ing the rate capability compared with bare cathode materials” *. 


Lithium-sulphur batteries. Lithium-sulphur batteries (LSBs), 
through the redox reaction of metallic lithium (anode) and elemen- 
tal sulphur (cathode), could provide a remarkably high theoretical 
specific energy of up to 2,600 Wh kg” (ref. 39). Despite the intrinsic 
advantages of sulphur in terms of low cost, abundance, low toxic- 
ity and high theoretical specific capacity (1,672 mAh g”), LSBs are 
affected by several drawbacks: (1) slow kinetics owing to the low 
electrical conductivity of the redox reaction products; (2) low energy 
efficiency; (3) poor cycle life as a direct result of the dissolution of 
the intermediate reaction products (polysulphides) in the electro- 
lyte; and (4) large volume changes during the electrochemical reac- 
tion®. Graphene has been proposed as a good candidate to address 
these issues because of its high electrical conductivity and capabil- 
ity of trapping the charge/discharge products”. Several studies have 
reported that RGO and GO are suitable substrates for the deposi- 
tion of sulphur micro- and nanoparticles®®. Good encapsulation 
seems to be achieved in both cases; however, the presence of epoxy 
and hydroxyl groups in GO promotes the immobilization of sulphur 
and thus prevents its dissolution”. Promising performance — with 
specific capacities >1,000 mAh g — has also been obtained with 
hybrid graphene-polymer-sulphur composites, although an accept- 
able capacity retention during cycling has not yet been achieved". 
Recently, different graphene-sulphur composites have been syn- 
thesized and tested in LSBs*!*. They exhibit a good performance 
in terms of capacity, Coulombic efficiency and stability during 
cycling*'™, even if the values reported are not higher than those 
obtained with other carbon-encapsulated sulphur cathodes, such as 
ordered mesoporous carbon™. 

From these results, graphene might be a possible candidate for 
encapsulating sulphur on LSB cathodes. However, real advances in 
the field require improvements to the sulphur/graphene interface in 
order to achieve stable electrochemical performance”. 


Developing applications of graphene 

The recent outbreak of graphene in the field of electrochemical energy 
storage has spurred research into its applications in novel systems 
such as magnesium-ion batteries (MIBs), which is one of the newest 
members of the metal-ion battery family. MIBs have been proposed 
as a high-energy-density and environmentally friendly replacement 
for LIBs*. Although research in this field is still at an early stage, 
a few graphene-based composites have recently been proposed as 
MIB cathode materials***’. Although the results obtained so far 
show very poor electrochemical behaviour, they could represent 
the first attempts to use graphene in MIBs. Interestingly, Wang et al. 
have already patented a rechargeable magnesium-ion cell based on 
graphene active materials®. 

The employment of graphene has also been considered for 
the improvement of vanadium redox-flow batteries (VRFBs). 
Patented in 1986*°, VRFBs enable energy storage using V**/V** and 
[VO**]/[VO,*] redox couples as negative and positive acid electrolytic 
solutions, respectively”. Carbon-based materials, such as cloths or 
graphite felts, are used as electrodes because of their electrochemical 
stability and wide operating potential’'. Unfortunately, despite their 
high SSAs, these electrodes do not exhibit satisfactory electrochemi- 
cal properties”. Accordingly, the use of graphene-based materials has 
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Figure 4 | Structural models and a possible drawback of graphene composites. a, Schematic of the different structures of graphene composite electrode 
materials. All models (except where specifically indicated) refer to composites in which graphene and the active material are synthesized through one- 
pot processes. Encapsulated: Single active-material particles are encapsulated by graphene, which acts as either an active (for example, LIB anodes) or 
an inactive (for example, LIB cathodes) component. Mixed: Graphene and active materials are synthesized separately and mixed mechanically during the 


electrode preparation. In this structure, graphene may serve as an inactive conductive matrix (for example, LIB cathodes) or an active material (for example, 
LIB anodes). Wrapped: The active-material particles are wrapped by multiple graphene sheets. This structure well-represents pseudocapacitor electrodes, in 
which graphene is the active material, as well as metal-ion battery cathodes, where graphene is an inactive component. Anchored: This is the most common 
structure for graphene composites, in which electroactive nanoparticles are anchored to the graphene surface. This structure is very relevant for metal-ion 
battery anodes and pseudocapacitors, where graphene serves as an active material, as well as for metal-ion battery cathodes and in LSBs, where graphene 
acts as an inactive component. Sandwich-like model: Graphene is used as a template to generate active material/graphene sandwich structures. This 
graphene-composite model, although not widespread, is used for LIB cathodes. Layered model: Active-material nanoparticles are alternated with graphene 
sheets to form a composite layered structure, which has been proposed for use in metal-ion battery anodes and cathodes. b, Li* paths in carbon black- (left) 


and graphene- (right) based electrodes in the mixed structural model. The figure highlights a possible drawback of graphene in terms of Li* mobility. 


been proposed to improve electrical conductivity, kinetic reversibility 
and electrochemical activity of these electrodes”. Over the past few 
years, a small number of studies have investigated the electrochemi- 
cal properties of GO-based®!, RGO* and RGO-based compos- 
ites*-°° for application in VRFBs. All of these reports show promising 
electrochemical performance for graphene-supported carbon elec- 
trodes, specifically in terms of high peak current density, reduced 
overpotential and decreased charge-transfer resistance. Additionally, 
GO” and commercial graphene” have recently been tested as addi- 
tives in VRFB ion-exchange membranes, with the aim of reducing 
the vanadium permeation and preventing ionic cross-mixing. The 
results achieved so far seem promising when compared with those 
obtained with bare membranes. However, the development of a suc- 
cessful commercial VRFB containing graphene is still far away. 


Conclusions 
It has been ten years since the beginning of the graphene era, and 
the rush to find new applications for this exciting material is more 
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vibrant than ever. However, despite the enormous amount of data 
produced throughout research laboratories across the globe, it is 
still not clear whether graphene has the potential to revolutionize 
many aspects of our lives. This is particularly appropriate for the 
field of electrochemical energy storage, in which ‘graphene fever’ 
has reached rather high levels due to the continuous need for new 
materials that can meet the market’s performance requirements. 
Graphene promises to increase substantially the energy- and 
power-density of practical systems, as well as enable the develop- 
ment of next-generation devices. However, the results so far tend to 
suggest that real breakthroughs are still to come. As was the case for 
many other innovative materials in the past, the main task is to close 
the gap between laboratory-scale research and practical applica- 
tions. The first challenge lies in the production of graphene. Owing 
to its peculiar nature, the electrochemical properties of this mate- 
rial are strictly dependent on its method of production, and so are 
its chances of finding an application in EESDs. Nowadays, the vast 
majority of graphene-based materials are produced by the reduction 
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of GO to RGO. This method is relatively cheap and has the potential 
for scalability — mandatory properties for widespread adoption — 
but introduces both intrinsic and extrinsic defects, which strongly 
affect electrochemical properties. Although for some applications 
the defects function as catalyst sites to improve cell kinetics (for 
example, in lithium-air and sodium air batteries), in other cases 
they strongly reduce the performance. For example, RGO cannot 
compete with the widespread carbonaceous materials commonly 
employed in commercial LIBs. In fact, despite their very promis- 
ing initial performance, RGO electrodes show a limited cycle life” 
compared with well-established graphite electrodes. With respect to 
EDLCs, the limited cycle life and low density of RGO-based elec- 
trodes prevents their transition to the commercial stage. However, 
some strategies for improving the packing density of graphene- 
based materials have been proposed. Nevertheless, the macroporous 
nature of graphene, in general, seriously affects its volumetric energy 
density. In this case, the common practice of evaluating EDLC per- 
formance through gravimetric data might lead to misleading con- 
clusions. Because low-density and few-micrometre-thick electrodes 
are often reported, volumetric data are surely more appropriate. 

In view of the funding and human resources devoted world- 
wide to this unique material, we may expect to see a turnover in 
the not-too-distant future. Some important results support this 
vision. In fact, a growing body of research into graphene-based full 
LIBs*”*8°! is continuing to prove the benefits of graphene for this 
important application. In addition, it has been demonstrated that 
graphene (or RGO) may find its true role when employed in com- 
posite electrodes. Here, graphene layers and electroactive particles 
work symbiotically, with the former providing a stiff and conduc- 
tive matrix, which can buffer eventual volume changes, and the lat- 
ter helping to avoid layer re-stacking. Graphene-based composites 
have, in fact, shown outstanding performance in LIBs**~**, SIBs*!~°, 
pseudocapacitors**** and LSB*"**, Moreover, a few preliminary 
studies into full SIBs°!*> have confirmed graphene exploitation in 
the ‘beyond lithium’ battery generation. Nevertheless, the most 
crucial point is the nano-architecture of the composite. Indeed, if 
proper nanoscale engineering is achieved, such compounds will 
surely play a crucial role in the progress of the field. 

Winning the graphene ‘gold rush’ requires consistent investment 
and commitment from industry and research-funding institutions. 
In this scenario, research scientists are those who occupy the most 
prominent position, by highlighting the benefits and, most impor- 
tantly, addressing the issues that still hinder the large-scale applica- 
tion of graphene in EESDs. 


Received 4 March 2014; accepted 7 November 2014; published 
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Interconnected hollow carbon nanospheres 
for stable lithium metal anodes 


Guangyuan Zheng', Seok Woo Lee?, Zheng Liang’, Hyun-Wook Lee?, Kai Yan?, Hongbin Yao2, 
Haotian Wang?, Weiyang Li?, Steven Chu‘ and Yi Cui#5* 


For future applications in portable electronics, electric vehicles and grid storage, batteries with higher energy storage 
density than existing lithium ion batteries need to be developed. Recent efforts in this direction have focused on high- 
capacity electrode materials such as lithium metal, silicon and tin as anodes, and sulphur and oxygen as cathodes. Lithium 
metal would be the optimal choice as an anode material, because it has the highest specific capacity (3,860 mAh g™) and 
the lowest anode potential of all. However, the lithium anode forms dendritic and mossy metal deposits, leading to serious 
safety concerns and low Coulombic efficiency during charge/discharge cycles. Although advanced characterization 
techniques have helped shed light on the lithium growth process, effective strategies to improve lithium metal anode 
cycling remain elusive. Here, we show that coating the lithium metal anode with a monolayer of interconnected amorphous 
hollow carbon nanospheres helps isolate the lithium metal depositions and facilitates the formation of a stable solid 
electrolyte interphase. We show that lithium dendrites do not form up to a practical current density of 1mA cm™2. The 
Coulombic efficiency improves to ~99% for more than 150 cycles. This is significantly better than the bare unmodified 
samples, which usually show rapid Coulombic efficiency decay in fewer than 100 cycles. Our results indicate that 


nanoscale interfacial engineering could be a promising strategy to tackle the intrinsic problems of lithium metal anodes. 


hen interest in secondary lithium batteries began to 
\/\V sne more than four decades ago! it was clear that, 
to make viable Li metal anodes, two fundamental chal- 
lenges would need to be resolved: (1) accommodating the large 
change in electrode volume during cycling (unlike graphite and 
silicon anodes, where lithiation produces volume changes of 
~10% and 400%, respectively, Li metal is ‘hostless’ and its relative 
volumetric change is virtually infinite); and (2) controlling the reac- 
tivity towards the electrolyte (Li is one of the most electropositive 
elements)*~®. Even today, there is still very little control over the 
thickness, grain size, chemical composition and spatial distribution 
of the solid electrolyte interphase (SEI), which, together, make the 
battery inefficient”*. One problem lies in the fact that the SEI 
layer cannot withstand mechanical deformation and continuously 
breaks and repairs during cycling. As a result, Li metal batteries 
have low Coulombic efficiency (80-90% for carbonate solvents 
and 90-95% for ether solvents)? and low cycle life due to the 
rapid loss of Li and electrolyte in the continuous formation/dissolu- 
tion of the SEI/®. A second problem is that Li deposition is not 
uniform across the electrode surface and can form large dendrites 
that cause short circuiting of the battery!’ !°. Third, reactions 
between the Li metal and the electrolytes are exothermic and large 
surface areas can pose risks of overheating (thermal runaway)". 
Considerable effort has been directed at addressing these pro- 
blems using both solid and liquid electrolytes. As solid electrolytes, 
polymers and ceramics have been developed for their perceived 
ability to mitigate dendrite nucleation!*!© and block their 
growth!”-?°. However, most solid electrolytes have low ionic conduc- 
tivity, resulting in low power output. Moreover, Li polymer batteries 
need to be operated at elevated temperatures to achieve reasonable 
power, at the expense of mechanical  stability?-?*. Ceramic 


solid electrolytes with a framework structure, such as LijgGeP2Sj2 
and garnet type Li,La3;Zr,0,,, have been investigated for 
their high Li ion conductivity (~1 x 10-7 to 1 x 107* S em7)!92324, 
but, like their polymer counterparts, interfacial issues remain 
largely unresolved>”®, 

In the case of liquid electrolytes, a great deal of research has 
focused on using additives’”-”? together with chemical passivation 
of the Li metal surface to reduce electrolyte decomposition*®?!. 
However, the thin films formed on the Li metal using these 
methods consist mainly of Li compounds, which are brittle and 
have limited cohesion with the metal surface*?. Consequently, 
upon Li deposition, the film surface usually cracks as a result of 
volumetric expansion, exposing fresh Li metal for further reactions 
(Fig. la). Lithium dissolution then takes place, creating pits and cre- 
vices with low impedance*, and Li ions flow at the defects, leading 
to rapid growth of metal filaments and dendrites. Stabilizing the 
interface between the Li metal and the electrolyte is therefore key 
in improving the cycling performance of Li metal batteries. 

The ideal interfacial layer for the Li metal anode needs to be 
chemically stable in a highly reducing environment, and also 
mechanically strong. High flexibility is desired to accommodate 
the volumetric expansion of Li deposition without mechanical 
damage. In addition, the ability to control the flow of Li ions with 
the SEI inhomogeneities is essential to ensure uniform Li depo- 
sition**. Here, we describe a flexible, interconnected, hollow amor- 
phous carbon nanosphere coating with the aim of realizing such 
an ideal interfacial layer (Fig. 1b). The advantages of our approach 
are threefold: (1) amorphous carbon is chemically stable when in 
contact with Li metal; (2) the thin amorphous carbon layer does 
not increase the impedance to charge transfer, but has a Young’s 
modulus** of ~200 GPa, high enough to suppress Li dendrite 
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Figure 1 | Schematic diagrams of the different Li anode structures. a, A thin film of SEI layer forms quickly on the surface of deposited Li (blue). Volumetric 
changes during the Li deposition process can easily break the SEI layer, especially at high current rates. This behaviour leads to ramified growth of Li 
dendrites and rapid consumption of the electrolytes. b, Modifying the Cu substrate with a hollow carbon nanosphere layer creates a scaffold for stabilizing 
the SEI layer. The volumetric change of the Li deposition process is accommodated by the flexible hollow-carbon-nanosphere coating. 
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Figure 2 | Fabrication of hollow carbon nanosphere-coated electrode. a, Fabrication process for the hollow carbon nanosphere-modified Cu electrode. Left 
to right: Polystyrene nanoparticles are first deposited onto the Cu substrate; a thin film of amorphous carbon is coated on top of the polystyrene array using 
flash-evaporation of carbon cord; thermal decomposition of the polystyrene template results in the formation of interconnected hollow carbon nanospheres. 
b,c, SEM images of the carbon-coated polystyrene nanoparticle array at low (b) and high (e) magnifications. The slight morphology change of the carbon 
nanospheres to a hexagonal shape could be due to the elevated temperature during the carbon-coating process. d, Digital camera image of the as-fabricated 


hollow carbon nanosphere thin film after removal of the polystyrene template. e, Cross-sectional SEM image of the hollow carbon nanospheres. f, TEM 
image of the hollow carbon nanospheres, with wall thickness of ~20 nm. g, SEM image of the hollow carbon nanosphere thin-film peeled off the Cu 


substrate. Red dashed line: trace of the curvature of bending. 


growth (theoretical calculations have shown that a solid film with 
modulus higher than 6 GPa should be sufficient to this end?°); 
and (3) a hollow nanosphere layer is weakly bound to the metal 
current collector and can move up and down to adjust the avail- 
ability of empty space during cycling. The top surface, formed 
from the hollow carbon nanospheres, is static and forms a stable, 
conformal SEI, while Li metal deposition takes place underneath, 
on the metal current collector. The stable SEI on the carbon nano- 
sphere surface helps reduce the flow of Li ions towards the regions of 
broken SEI on the metal current collector (Supplementary Fig. 1). 


Fabrication of electrode 

We have developed a template synthesis method for fabricating the 
hollow carbon nanopheres, using vertical deposition of polystyrene 
nanoparticles (Fig. 2a). A colloidal multilayer opal structure is 
formed on Cu foil by slowly evaporating a 4% aqueous solution of 
carboxylated polystyrene particles. The highly monodisperse poly- 
styrene nanoparticles form a close-packed thin film with long- 
range order (Fig. 2b)*°. The polystyrene nanoparticles are coated 
with a thin film of amorphous carbon using flash evaporation of 
carbon fibres (Fig. 2c). The slight morphology change of the 
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Figure 3 | Li deposition on a Cu substrate with and without carbon nanosphere modification. a, Top-view SEM image of hollow carbon nanospheres after 
the initial SEI formation process. Inset: the hollow carbon nanosphere structure is preserved after SEI coating. b, Cross-sectional SEM image showing the 
initial deposition of Li metal under carbon nanospheres. ¢, Deposited Li elevates the hollow carbon nanosphere thin film due to weak binding with the Cu 
substrate. The carbon nanosphere coating allows more uniform Li flux, and the deposited Li is columnar rather than dendritic. d, Top-view SEM image 
showing the smooth surface of the electrode with the carbon nanosphere modification. e, For the electrode without carbon nanosphere modification, ramified 
growth of mossy Li/dendrites is clearly observed. f, Corresponding top-view SEM image of the electrode without modification. g, Schematic showing the 
configuration of the in situ TEM cell. Hollow carbon nanospheres are grown on a Cu wire, serving as the working electrode. The counter-electrode consists of 
a small piece of Li metal coated with Li,O (solid electrolyte) on the tip of a W wire. A voltage bias of about -5 V is applied between the two electrodes to 
drive Li deposition. h, TEM images of the Li deposition process on Cu wires decorated with hollow carbon nanospheres taken at different times. Li metal 
approaches the carbon nanospheres from the right, and deposition is observed once a voltage bias is applied. See Supplementary Information for full movie. 


carbon nanospheres to a hexagonal shape could be due to the elev- 
ated temperature during the carbon coating process. The samples 
are then heated in a tube furnace to 400 °C under an inert atmos- 
phere, forming hollow carbon nanopheres on the Cu substrate 
(Fig. 2e). Transmission electron microscope (TEM) characterization 
shows that the carbon wall has a thickness of ~20 nm (Fig. 2f). The 
hemispherical carbon nanospheres are interconnected to form a 
thin film (Supplementary Fig. 2a), which can be peeled off the Cu 
surface easily (Supplementary Fig. 2b). Loose attachment of the 
carbon film to the Cu electrode is important in that it allows the pro- 
tective film to be lifted up, creating space for Li deposition. 
Mechanical flexibility is also important in accommodating the volu- 
metric change of Li deposition and dissipating the stress exerted on 
the Li protection layer during cycling. A digital camera image 
(Fig. 2d) and scanning electron microscopy (SEM) image (Fig. 2g) 
show that the carbon nanosphere thin film can achieve a bending 
radius of ~20 um. 


SEM characterization of Li deposition 

The top surface of evaporated carbon is highly insulating due to the 
large amount of tetrahedral bondings**, while the bulk has a con- 
ductivity of ~7.5Sm‘', as calculated from four-point-probe 
measurements (Supplementary Fig. 3). The low conductivity of 
the evaporated carbon is a result of the lack of long-range order 
in its structure and can reduce direct Li deposition onto the 
carbon*”. The graphitic regions would initially be lithiated and 
form a stable SEI on top of the carbon nanospheres to prevent pen- 
etration of solvent molecules (Supplementary Fig. 4a). Figure 3a 
presents a top view of the hollow carbon nanospheres after SEI 
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formation. The cross-sectional image in the inset to Fig. 3a shows 
that the hollow nanosphere structure is preserved after cycling. 
The electrochemical performance of the as-fabricated anode struc- 
ture was tested using constant current polarization. Figure 3b 
shows the hollow-nanosphere-modified electrode at the beginning 
of Li deposition. Li metal starts to nucleate within the hollow 
carbon nanospheres on the Cu substrate, and, as Li continues to 
deposit, granular Li begins to grow, elevating the hollow carbon 
nanosphere film (Fig. 3c, Supplementary Fig. 4e), confirming 
our design of depositing Li metal underneath the carbon. The 
deposited Li metal has a column-like morphology with a diameter 
of ~3.0 + 0.3 um, with no long filaments or dendrites protruding, 
as is common when Li is deposited on bare Cu (Fig. 3e). The 
drastic change in morphology is a good indication of the lack of 
an SEI layer on the deposited Li, allowing the Li to merge. In the 
control cell, the deposited Li is immediately passivated by the SEI 
layer, which prevents the Li metal from merging, thus significantly 
increasing the surface area. As shown in Fig. 3d, after 50 cycles of 
charge/discharge at 1 mA cm’, the top surface of the electrode 
is relatively uniform, without an overgrowth of Li dendrites. In con- 
trast, direct deposition of Li metal onto a Cu electrode results in a 
very uneven growth of mossy Li, with thin Li filaments clearly 
visible (Fig. 3f). Another control sample tested involved the 
cycling of Li on a flat carbon-coated Cu electrode without nano- 
sphere morphology (Supplementary Fig. 6). Similar to a previous 
study on an amorphous carbon thin-film-coated electrode*’, the 
rigid carbon coating tends to crack upon cycling, and the 
Coulombic efficiency drops rapidly after about 50 cycles. 
Comparison with the flat carbon film therefore highlights the 
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Figure 4 | Electrochemical characterization of the electrodes for Li deposition/dissolution. a, Comparison of cycling performances of the hollow carbon 
nanosphere-modified electrode (solid symbols) and the control Cu electrode (hollow symbols) at different current rates. The amount of Li deposited in each 
cycle is 1 mAh cm~. b, Voltage profiles of the Li deposition/dissolution process with Li metal as the reference electrode at 0.5 mA cm™~. ¢, Comparison of 
the hysteresis of Li deposition/dissolution for the modified electrode and the control electrode with Li metal as the reference/counter-electrode. 


differences in the flexibility of the hollow nanosphere interfacial 
layer and its weak bonding to the Cu current collector. 


In situ TEM observation of Li deposition 

To further understand the Li deposition phenomenon within the 
hollow carbon nanospheres, we carried out in situ transmission 
electron microscopy (TEM) experiments**° using a specialized 
dual-probe biasing TEM holder (Nanofactory Instrument). One 
probe was a Cu metal wire decorated with hollow carbon nano- 
spheres and the other consisted of a W wire with a small piece of 
Li metal attached to the tip (Fig. 3g). Because the Li metal was 
exposed to air for a few seconds when transferring the holder into 
the TEM, a thin layer of Li,O formed on the Li metal. This thin 
oxide layer acts as a solid electrolyte for the nanoscale electrochemi- 
cal cell. By manipulating a piezoelectric motor on the TEM holder, 
the hollow carbon nanospheres came into contact with the lithium 
oxide and a voltage bias was applied to drive the Li ion through the 
oxide solid electrolyte towards the carbon nanospheres. Figure 3h 
presents a series of bright-field TEM images of the carbon nano- 
spheres during the Li deposition process. The experiments show 
that Li begins to deposit on the Cu wire underneath the carbon 
nanospheres immediately after application of the voltage bias. 
After about 25 s of Li deposition, the average thickness of the Li 
increased by 26%. Further deposition for another 6 s increased the 
Li thickness by another 25%. The morphology change of the depos- 
ited Li in lifting up the carbon nanospheres can be seen in the 
Supplementary Movie, which confirms, visually, the concept of 


depositing Li underneath carbon while maintaining the integrity 
of the carbon nanospheres. 


Electrochemical testing of the modified electrodes 

The demonstrated stable interfacial layer of carbon hollow nano- 
spheres opens up the opportunity to improve the Coulombic effi- 
ciency of Li metal anodes. Coulombic efficiency is an important 
parameter for long cycle life and is defined as the ratio of the 
amount of Li that is stripped from the working electrode versus 
the amount that is plated during each cycle. Because the cycle life 
of batteries with Li metal electrodes is related to electrolyte 
decomposition”**, a fair comparison of electrode performance is 
to use a controlled amount of electrolytes. To standardize the elec- 
trochemical performance, ~30 ul electrolytes was used in each coin 
cell test. In the half-cell configuration Li was electrochemically 
deposited (at 1 mAh cm’) from the Li metal counter-electrode 
onto the hollow nanosphere-modified working electrode and then 
stripped away. Here, the Coulombic efficiency reflects the loss on 
the working electrode, because the Li metal counter-electrode has 
excess Li. In cycle life testing these batteries fail due to the depletion 
of electrolytes as a result of reaction with the Li metal®. 
Consequently, the internal resistance increases rapidly in batteries 
that have severe electrolyte decomposition. The reduced electrolyte 
contact with active material also results in a pronounced increase in 
local current density, which subsequently leads to more dendrite 
formation*!. The analysis of electrochemical performance shows 
that the cycling performance of the Li metal working electrodes 
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with the carbon nanosphere coating is significantly improved. 
The Coulombic efficiency is maintained at ~99% for more than 
150 cycles at 0.25 mA cm~™ and ~98.5% at 0.5 mA cm ~ (Fig. 4a). 
In comparison, cells without the hollow carbon nanosphere 
coating show a gradual decrease in Coulombic efficiency, which 
eventually drops to less than 50% after 100 cycles at 0.25 mA cm~ 
and 0.5 mA cm. In the control sample with Cu foil coated with 
flat carbon film only, the performance is also relatively poor, with 
the Coulombic efficiency dropping to below 90% after 70 cycles 
(Supplementary Fig. 6). When tested at a high current density of 
1mAcm™, the Coulombic efficiency of the Li metal working 
electrode with a carbon nanosphere coating is maintained at 
~97.5% for more than 150 cycles, while the control Cu electrode 
showed rapid decay after 100 cycles. Using an alternative testing 
method proposed by Aurbach and co-workers (Supplementary 
Methods)”2, in which 2.5 mA cm™ of Li was initially deposited, 
followed by 10 cycles of deposition/dissolution of 0.5 mAh cm * 
Li, we were able to achieve a Coulombic efficiency of ~99.5% at 
0.5mAcm~, which is higher than the previous results. For 
example, Li metal cycling in ether-based electrolyte usually has a 
Coulombic efficiency of ~95-98% (refs 8,43). There have also 
been attempts to use electrolyte additives***° and other conditions, 
such as the application of high pressure**, to improve Li metal 
performance. Those results usually show low Coulombic efficiency 
but with large variation during cycling (sometimes reaching more 
than 100% for a few cycles)**. The sporadic high Coulombic effi- 
ciency is probably due to the activation of disconnected mossy Li 
from previous cycles. However, the Li metal batteries in the 
present study show consistently stable, high-Coulombic-efficiency 
cycling, which can be attributed to the more uniform Li deposition 
under the hollow carbon nanospheres, more stable SEI formation on 
top of the spheres, and reduction of electrolyte decomposition. 
Impedance spectroscopy revealed that the carbon-nanosphere- 
modified electrode has lower interfacial charge transfer resistance 
than the control electrode due to the preservation of a stable SEI 
layer (Supplementary Fig. 4). The effect of stable SEI formation and 
reduction of electrolyte decomposition can also be seen in the 
reduction of polarization (hysteresis) in the voltage profile during 
Li deposition/dissolution. The Li deposition voltage for the modified 
electrode is approximately -25 mV (versus Li/Li*), whereas that for 
the pristine Cu is ~50mV. The Li dissolution is 25 mV and 
50 mV, respectively (Fig. 4b). For the electrode without modification, 
the voltage hysteresis in the Li deposition/dissolution increases gradu- 
ally as the cycle number increases, with a difference in potential of 
~210 mV after 80 cycles (Fig. 4c). With the hollow carbon nano- 
sphere modification, the hysteresis is much smaller, only ~50 mV 
after 50 cycles. This smaller hysteresis is attributed to the lower 
charge transfer and internal resistance resulting from the thinner 
SEI layer, which are also evident in the cycling of the different 
anodes with LiFePO, cathodes (Supplementary Fig. 5b). The 
hollow-carbon-nanosphere thin film can be transferred onto the Li 
metal foil to be paired with Li-containing cathode materials such as 
LiFePO, for high-energy-density batteries (Supplementary Fig. 5a). 


Conclusions 

We have shown that an interfacial layer of hollow carbon nano- 
spheres allows stable Li metal anode cycling up to a practical 
current density of 1mAcm™~ and with an areal capacity of 
1mAhcm™. The cycling Coulombic efficiency can be highly 
stable at ~99% for more than 150 cycles. Future research is 
needed to develop the application of this approach to practical bat- 
teries (the Coulombic efficiency needs to be improved to >99.9% for 
practical batteries, and alternative electrolyte combinations need to 
be developed to meet different battery chemistries). A viable route to 
this end could be to combine the nanoscale engineering approach 
described here with electrolyte additives. Anodes with interfacial 
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layers on the current collector could be combined with cathodes 
with preloaded Li ions, such as the existing Li metal oxides and 
Li,S. Our work demonstrates that the interfacial nanoscale engineer- 
ing approach can improve Li metal cycling performance. We believe 
that the nanoengineering concepts we have described may be a 
viable route towards Li metal anode batteries and, more specifically, 
to high-energy-density batteries, such as Li-S and Li-O,. 


Methods 

Fabrication of modified electrode. A 100 ul volume of polystyrene nanoparticles 
(0.78 um) aqueous suspension (4 wt/wt%, Thermal Scientific) was dropcast onto a 
Cu foil disk (7/16 inch) and the solvent allowed to evaporate at room temperature 
for ~2 h. The polystyrene nanoparticles then self-assembled into a hexagonally 
close-packed structure. The vertical deposition of colloidal crystal was a result of the 
small density difference between the polystyrene nanoparticles and the solvents. As a 
result, the evaporation velocity of the colloidal solvent exceeded the sedimentation 
velocity of the nanoparticles, allowing the nanoparticles to accumulate at the 
solvent-air interface. As particle concentration increased, lateral capillary 
immersion forces arranged the nanoparticles into hexagonal packing*. To form 
carbon nanospheres, the close-packed polystyrene nanoparticles were first coated 
with amorphous carbon in a carbon coater (EMS150R ES). Carbon fibres were 
used as the evaporation target. The evaporation chamber was pumped down to 

5 x 10°” mbar before an outgassing current of 30 A was passed through the carbon 
fibres. After outgas recovery, a pulse current was passed through the fibre to 

allow flash-evaporation of carbon. The pulse current was set to 60 A for 20 s, with a 
10 s interval between pulses. To remove the polystyrene templates, the sample 

was placed in a tube furnace and heated under Ar at 400 °C for 1.5 h (ramping rate of 
5 °C min '). The hollow carbon nanospheres were plasma-treated to facilitate the 
formation of a stable SEI (Supplementary Fig. 2c), and the electrode was then coated 
with polyvinyldene fluoride (PVDF) by spin-coating 100 ul of 5% PVDF solution in 
N-methyl pyrrolidone (NMP) onto the sample (1,000 r.p.m. for 1 min). NMP 
solvent was removed by placing the samples in a vacuum oven for 3 h at 50 °C. 
To transfer the hollow-carbon-nanosphere thin film onto the Li metal anode, the Cu 
substrate used in the fabrication process was etched away in (NH4)2S20g solution 
and the thin film dried in a vacuum oven before being pressed onto the Li 

metal anode. 


Fabrication of the control electrode. The control electrode was fabricated by first 
spin-coating a thin layer of PVDF onto the Cu current collector. After drying, the 
electrode was assembled in a coin cell with Li metal as both the reference and 
counter-electrode. Pretreatment of the control electrode was carried out as in the 
modified electrode by cycling the battery between 0 and 2 V for 10 cycles. The 
electrode was then tested by depositing and dissolving a controlled amount of Li at 
different current densities. 


Electrochemical testing. Galvanostatic experiments were performed using a 96- 
channel battery tester (Arbin Instrument). The working electrodes were assembled 
in 2032-type coin cells (MTI Corporation) with Li metal (Alfa Aesar) as the 
reference electrode and counter-electrode. The electrolyte was 1 M lithium bis 
(trifluoromethanesulphonyl)imide (LiTFSI) in 1,3-dioxolane and 1,2- 
dimethoxyethane (volume ratio 1:1) with 1% lithium nitrate (LiNO;) and 100 mM 
LipSg additives. The presence of LiNO3 and Li,Ss helps in the formation of a stable 
SEI on the Li metal electrode. For the coloumbic efficiency test, Li metal was used as 
both the working and reference electrodes. The Li metal reference electrode was 
soaked in a 2% LiNO; solution in DOL/DME overnight before assembling the coin 
cells. To standardize the testing, 30 ul electrolytes was used in each coin cell testing. 
The batteries were first cycled between 0 V and 2 V to form a stable SEI on the 
hollow carbon spheres (Supplementary Fig. 2d). Cycling tests were carried out 

by first depositing 1 mAh of Li onto the Cu electrode, followed by Li stripping up to 
2 V. To test the modified anode in a full cell, LifePO, (MTI Corp) at 1 mAh cm” 
was used as the cathode material. Measurements of a.c. impedance were carried out 
using a Bio-Logic VMP3 tester with a frequency range between 0.1 Hz and 1 MHz. 
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The renaissance of lithium metal: 
SolidEnergy’s role in the future of 


lithium batteries 


he pursuit of high energy density is at 

the heart of smartphones, wearable 

gadgets and electric vehicles — devices 
that are quickly becoming extensions of our 
bodies. Lithium, which is the lightest and 
most electronegative metal in the periodic 
table, is anatural choice as anode. However, 
because lithium metal is highly reactive it was 
sidelined during much of the 1990s and 2000s 
and replaced with Li-ion batteries that do not 
contain lithium metal. With recent advances 
in electrolyte, lithium metal is making a strong 
comeback. SolidEnergy was founded in 2012 
during the turmoil of the lithium-battery 
industry crisis to introduce transformational 
changes in battery safety, energy density anda 
new business model. SolidEnergy is developing 
a renaissance technology, electrolyte and 
anode materials for safe and ultra-high-energy- 
density Li-Metal batteries. And because the 


Li-Metal batteries enabled by its materials 

can be manufactured using existing Li-ion 
manufacturing facilities, the company is also 
developing a renaissance business model 

by building an open ecosystem, strategically 
partnering with established companies 

to accelerate the commercialization of 
disruptive technologies in the battery industry. 
SolidEnergy’s mission is to power people's lives, 
whether they are communicating with loved 
ones ona phone or driving with their family in 
an electric car. 


The history of lithium batteries: 
Li-ion versus Li-Metal 

In the past 30 years there has been tremendous 
progress in the lithium-battery industry 

(both Li-Metal and Li-ion), including new 
cathodes, safer coatings on separators and 
better cell engineering'”. There were only a 
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Figure 1 | The history of lithium batteries from an anode perspective. 
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few step-change developments in anodes and 
electrolytes, and the history of lithium batteries 
can be categorized based on anodes (Figure 1). 

Given its low electronegativity (-3.04V 
versus standard hydrogen electrode), low 
specific gravity (0.53 g/cm’) and high lithium 
ion specific storage capacity (3,850 mAh/g), 
lithium metal is the ideal choice as anode. 
Since the 1970s, lithium metal has continued 
to demonstrate its high energy density in 
primary batteries for applications such as 
implantable medical devices, space exploration 
and oil field services. The first rechargeable 
lithium batteries were also demonstrated in 
the laboratory with lithium metal anodes and 
intercalation cathodes such as TiS,, LiCoO,, VO, 
and MoS, by pioneers like Stanley Whittingham 
and John Goodenough. The first commercial 
rechargeable lithium batteries (Gen 0 in Figure 1) 
were developed by Moli Energy using lithium 
metal anodes in the late 1980s and had energy 
densities in the range of 100-200Wh/kg and 
200-300 Wh/L*. 

However, there was a major flaw with the 
old Li-Metal batteries (Gen 0), and that was the 
formation of mossy lithium during charging 
(not a problem for primary batteries). During 
charging, fresh lithium metal is plated onto a 
lithium metal anode. When the electrolyte is an 
organic carbonate liquid, it reacts with lithium 
metal to form mossy, dendrite-like structures 
that can pierce through the separator and 
lead to an internal short and even explosions, 
causing serious safety concerns. In addition, 
the reaction between the electrolyte and 
lithium metal anode forms unstable solid- 
electrolyte-interphase (SEI) layers consuming 
both electrolyte and lithium metal, leading to 
low coulombic efficiency and requiring a thick 
lithium metal anode (typically 3 to 5X excess) 
to achieve an acceptable cycle life (>200), 
resulting in low energy density. 

Given the concerns in both safety and low 
energy density, the industry moved away 
from Li-Metal and adopted a ‘lithium-metal- 
free’ Li-ion system, in which both the cathode 
and anode were intercalation compounds. 
Graphite, which has a lithium ion specific 
storage capacity of only 380 mAh/g (one-tenth 
that of lithium metal) but can allow lithium 
ions to intercalate and de-intercalate freely 
and form relatively stable SEI, replaced lithium 
metal as the new industry standard anode. 
Lithium exists in a Li-ion system only in ion form 
and not in metal form. Sony commercialized 
the first Li-ion batteries (Gen 1) by combining 
a graphite anode with a LiCoO, anode in 1991. 
Although Sony had safety issues of its own, 
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Figure 2 | The renaissance of lithium metal. The number of publications related to Li-Metal batteries 


(including those published on www.nature.com by Nature Publishing Group and in the following 


journals: Journal of the American Chemical Society, Journal of the Electrochemical Society, Journal of 


Power Sources, Electrochimica Acta, and Electrochemistry Communications). 


its Gen 1 Li-ion batteries were a dramatic 
improvement over Gen 0 Li-Metal batteries in 
both safety and energy density (albeit much 

of the improvement in energy density came 
from better cathodes and cell engineering), 
and Gen 0 Li-Metal batteries commercialized by 
Moli quickly faded away. 

Graphite-based Gen 1 Li-ion batteries 
gradually became mainstream throughout 
the 1990s, 2000s and even until today. In 
the mid-2000s, the industry tried to improve 
the specific capacity of graphite anodes by 
incorporating silicon in various forms to 
create silicon-carbon composite anodes with 
lithium ion specific storage capacity of up to 
1,500 mAh/g. Although such silicon-carbon 
composite anodes face challenges with cycle 
life, volume expansion and unstable SEI, they 
are considered Gen 2 Li-ion batteries, offering 
significantly higher energy density both 
gravimetrically (Wh/kg) and volumetrically 
(Wh/L) compared to Gen 1 Li-ion batteries. 


The development of all-solid-state 
Li-Metal batteries 

Efforts to solve the mossy lithium formation in 
lithium metal/liquid electrolyte combination 
that plagued Gen 0 Li-Metal batteries 
continued in the 1990s and 2000s. The focus 
was to replace the liquid electrolyte with a 
solid electrolyte and create an all-solid-state 
Li-Metal battery. The solid electrolytes included 
polymer electrolytes, which were typically 
complexes of lithium salts with lithium ion 
conducting polymers such as polyethylene 


oxide (PEO)°, and ceramic electrolytes, such as 
LiPON, thio-LISICON, La, <Li,<TiO,, Li,P,S,, and 
Li, ,GeP,S,,°’. Both of these solid electrolytes 
are non-flammable and non-volatile and are 
significantly safer than organic carbonate liquid 
electrolyte. 

The solid polymer electrolytes could be 
roll-to-roll solution processed on a large scale 
and were commercialized by companies such 
as Avestor, Bathium and Seeo. The solid ceramic 
electrolytes had to be vacuum deposited, 
an expensive technique more commonly 
found in the semiconductor industry, and 
were commercialized by companies such as 
Infinite Power Solutions, Cymbet and Sakti3. 
However, because of the low conductivity and 
poor electrode-electrolyte interfaces in solid 
electrolytes, solid-state Li-Metal batteries were 
restricted to either high temperature (>80 °C) or 
miro-size thin-film applications, thus could not 
be used in mainstream consumer electronics 
and only achieved limited use in niche electric 
vehicles. 

Although the lithium-metal-free Li-ion 
battery continues to reign supreme, there 
is a limit to its energy density. Li-ion relies 
on intercalation anodes such as graphite 
and silicon-carbon composite, which are 
inert. The intercalation anodes only provide 
host structures for lithium ions but do not 
contribute to energy storage themselves and 
are considered ‘deadweight’ This limits the 
energy density of Li-ion. Li-Metal, on the other 
hand, does not have such deadweight because 
lithium metal does not have any alien host 
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structure and consists purely of lithium ions. 


Li-Metal can be divided into three categories: 


Li/intercalant cathode; Li/sulfur; and Li/air. The 
most ambitious of the three is Li/air, which has 
a potential for >10,000 Wh/kg (approaching 


that of gasoline), but it remains a long way from 


commercialization because of fundamental 
science issues. Li/sulfur has demonstrated 
nearly 500 Wh/kg in commercial cells and has 
been used in aerospace applications, in which 
lightness is paramount. But its volumetric 
energy density (Wh/L) is significantly less 
than Li-ion, and its bulkiness has prevented 
its application in consumer electronics and 
electric vehicles. Li/intercalant cathode has 
demonstrated that with a lithium metal anode 
it can increase the energy density (both Wh/kg 
and Wh/L) by 100% compared to a graphite 
anode and 50% compared to a silicon-carbon 
composite anode (Figure 1). 

Whether the cathode is air, sulfur or high- 


voltage intercalant, the key enabling technology 


to all Li-Metal is the electrolyte — one that 

can enable stable cycling of lithium metal at 
high current density without the formation of 
mossy lithium. Since the late 2000s, there has 
been a renaissance in research on electrolytes 
for Li-Metal, as evidenced in the number of 
publications in this area (Figure 2 shows that 
Li-Metal was a hot topic in the beginning and 
then faded away due to Li-ion and is gaining 
momentum again). These include novel lithium 


salts (including room temperature ionic liquids)® 


that have higher coulombic efficiency on 
lithium, additives that smoothen mossy lithium 
during plating?, new approaches to salt:solvent 
ratio to improve coulombic efficiency", and 
novel materials and engineering techniques 
to build protective layers on lithium™'”. Gen 3 
Li-Metal is a renaissance technology that builds 
upon the all-solid-state batteries developed in 
the 1990s and 2000s, and addresses the high 
temperature and thin-film limitations. 


The story of SolidEnergy 
SolidEnergy Systems Corp. was incorporated 
in spring 2012 to develop and commercialize 
a safe and ultra-high energy density ‘anode- 
free’ battery (Gen 3 Li-Metal) using an 
ultra-thin lithium metal anode (the anode 

is so thin that it is almost anode-free) and a 
combination of solid polymer and ionic liquid 
electrolyte, a concept that was originally 
conceived at Massachusetts Institute of 
Technology (MIT). Unfortunately, 2012 saw 
one of the worst meltdowns in the history 

of the lithium battery industry, and many 
large Li-ion battery and electric-vehicle 
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Figure 3 | Half the size. A SolidEnergy prototype battery (left) has 400 Wh/kg and 1200 Wh/L, twice 
that of an Apple iPhone 6 battery (right). 
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Figure 4 | The ecosystem in which SolidEnergy operates. SolidEnergy innovates at the materials 
level, not the manufacturing level (its contributions are in green). 


manufacturers filed for bankruptcy after 
having raised tens of billions of dollars. It was 
an adverse environment for a young fledging 
company, but SolidEnergy persevered and 
rose out of the ashes. SolidEnergy took a 
worldwide exclusive license from MIT, formed 
strategic partnerships with the reborn A123 
and leading consumer electronics companies, 
and raised venture investment from major 
auto companies. In a little more than one 

year, SolidEnergy demonstrated in a real 2Ah 
prototype battery (not just based on simulation 
results) 400 Wh/kg and 1200Wh/L, twice the 
energy density of an Apple iPhone 6 battery 
(Figure 3), and operating at room temperature, 


all independently verified by third parties and 
at a speed unprecedented in the industry. 


SolidEnergy’s role in the ecosystem 
In addition to developing a renaissance 
technology SolidEnergy also adopted a 
renaissance business model. It has learned 
many important lessons from the perils of its 
predecessors, many of whom started with a 
disruptive technology but sidetracked to focus 
on battery manufacturing. These predecessors 
tried to compete with established companies in 
an extremely capital intensive arena that is red- 
hot crowded and with severe overcapacity. At 
the same time, established companies were too 
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Figure 5 | SolidEnergy’s roadmap for the anode-free battery (Gen 3 Li-Metal). 


conservative to put serious resources behind 
disruptive technologies. As a result, the lithium- 
battery industry has seen many incremental 
improvements, but rarely disruptive ones. 

SolidEnergy disciplinarily focuses on the area 
in which it can create the most value, which 
is key enabling battery materials. It innovates 
at the materials level, not the manufacturing 
level, but the batteries enabled by its materials 
can be manufactured using existing Li-ion 
manufacturing capability. This allows it to avoid 
massive infrastructure investment duplicating 
what the industry already has, leverage an 
established ecosystem, and efficiently capture 
the highest value. 

Figure 4 shows SolidEnergy’s business 
model and how it fits in the overall ecosystem. 
It acquires and processes raw materials from its 
strategic partners in chemical and equipment 
manufacturing. The company develops two 
key enabling battery materials: the anode, 
which consists of anode-lyte coating on 
lithium/copper; and the cathode-lyte (anode- 
lyte and cathode-lyte are two components 
of electrolyte), which consists of salts, ionic 
liquids and other chemicals. These two battery 
materials are then supplied to the battery 
manufacturers with a separator and a cathode 
to form complete batteries. SolidEnergy does 
not manufacture batteries, instead it provides 
key enabling battery materials. 

SolidEnergy works with battery end users 
such as consumer electronics and auto 
companies to finalize the battery design, and 
with battery manufacturers to develop the 
engineering and manufacturing processes. 
This open ecosystem integrates SolidEnergy’s 
anode-free battery design and materials 
seamlessly into the end user experience while 
minimizing infrastructure investment and 
industry redundancy. 

Many of SolidEnergy’s predecessors 
failed because they targeted exclusively 


electric vehicles and ignored consumer 
electronics. Electric vehicles have the toughest 
performance and cost requirements and the 
longest development cycle. For example, 
electric vehicles require eight-year warranty 
and smartphones are replaced every two 
years. But most of today’s successful electric- 
vehicle battery manufacturers built strong 
foundations in consumer electronics first. 

The fast innovation cycle and relatively 

easier performance and cost requirements in 
consumer electronics make it an ideal platform 
to develop, demonstrate and optimize a new 
battery technology before it becomes mature 
enough for electric vehicles. 


SolidEnergy’s vision for the future 
The commercialization roadmap for 
SolidEnergy’s anode-free battery touches 
several markets, including drones, watches & 
wearables, smartphones and electric vehicles 
(Figure 5). The sequence increases in both 
market size and entry difficulty. Drones and 
watches & wearables value high energy 
density but have relatively small capacity and 
are great beachheads to test a new battery 
technology. Smartphones and electric vehicles 
value scale and cost in addition to high energy 
density, and demand much higher capacity, 
but are fertile markets in which to build a large 
company in the long term. 

Although these markets may seem 
dramatically different and the batteries may 
have different form factors and different 
manufacturing processes, SolidEnergy’s key 
enabling materials are largely transferrable 
throughout and can ride the wave of 
innovation in cathodes and other aspects, 
giving them flexibility and longevity. 
SolidEnergy is introducing its materials for 
drones this year; watches & wearables in 
2016; smartphones in 2017; and electric 
vehicles in 2018. 
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It is now 2015. Li-ion can no longer satisfy 
our insatiable thirst for higher energy density, 
and even with the best engineering it is 
approaching its theoretical limit. If we refuse 
to let battery technology limit our desire for 
smarter devices and cleaner transportation, 
we must focus on the next practical solution, 
Li-Metal. Although challenges to Li-Metal 
remain daunting, the current renaissance in 
technology and new business model give us 
confidence that Li-Metal will soon reclaim its 
rightful spot in the industry. 
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SolidEnergy has developed a safe and ultra-high-energy 
density Li-Metal battery. It will introduce this battery to 
drones in 2015, watches in 2016, smart phones in 2017, and 
electric cars in 2018. Its mission is to power people’s lives. 


Visit us online at solidenergysystems.com 


